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ELITRA.006A PATENT 

CHITOBIASE AS A REPORTER ENZYME 

GOVERNMENT INTEREST IN THE INVENTION 
Work performed in developing the present invention was supported, in part, by 
5 National Science Foundation grant MCB-9507209, NIH MBRS grant GM45765, and 
NIH NIGMS MARC F31 GM14967-0451. Accordingly, the United States Government 
may have certain rights in the invention. 

Related Applications 
This application claims priority to U.S. Provisional Patent Application Serial 
10 Number 60/159,221, filed October 13, 1999, the disclosure of which is incorporated 
herein by reference in its entirety. 



Background of the Invention 

Field of the Invention 

15 Reporter enzymes are enzymes whose activities are easily assayed when present 

inside cells. In order to study the regulation of a gene whose expression is regulated by 
various environmental and/or cellular factors or influences, a gene encoding a reporter 
enzyme may be fused to tlie coding region or to the regulatory region of the regulated 
gene. 

20 One of the most popular cytoplasmic reporter enzymes for use in bacteria is B- 

galactosidase. It is widely used in the art; however, because bacteria such as 
Escherichia coli contain an endogenous B-galactosidase encoding gene and B- 
galactosidase may be present in the cytoplasm of such bacteria, deletions of the LacZ 
gene, the source of the enzyme, must be introduced into the host cell line prior to its 

25 use. One goal of the present invention is to provide an alternative intracellular enzyme 

for use as a reporter. 

This invention relates to genetic constructs and methods of using a cytoplasmic 
form of the chitobiase enzyme as a reporter. The invention also comprises expression 
vectors which express the cytoplasmic form of chitobiase. As used herein, all instances 

30 of the terminology "chitobiase" refer to a form of chitobiase which is present in the 

cytoplasm of the cell. Cytoplasmic forms of chitobiase may be generated via genetic 
engineering or microbial selection techniques. 
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Summary of the Invention 
One embodiment of the present invention is a method for characterizing a 
promoter comprising providing a construct comprising the promoter operably linked to 
a nucleic acid encoding a cytoplasmic form of chitobiase, introducing the construct into 
5 host cells, and identifying sequences in the promoter which regulate transcription levels. 

In one aspect of this embodiment, the cytoplasmic form of chitobiase lacks a signal 
sequence. In another aspect of this embodiment, the nucleic acid encoding a 
cytoplasmic form of chitobiase encodes a fusion protein v^hich comprises a cytoplasmic 
form of chitobiase fused to a heterologous polypeptide. In still another aspect of this 

10 embodiment, the nucleic acid encoding a cytoplasmic form encodes a cytoplasmic form 

of chitobiase obtained from an organism selected from the group consisting of 
Alteromonas sp. 0-7, Arabidopsis thaliana, Bacillus subtilis, Bombyx mori, Bos taurus, 
Caenorhabditis elegans, Candida albicans, Dictyostelium discoideum, Entamoeba 
histolytica, Felis catus, Homo sapiens, Korat cats, Lactobacillus casei, Leishmania 

15 donovani, Mus musculus, Pisum sativum, Porphyromonas gingivalis, 

Pseiidoalteromonas sp. S9, Rattus norvegicus, Serratia marcescens, Streptomyces 
plicatus, Streptomyces themoviolaceus, Sus scrofa, Trichoderma harzianum, Vibrio 
furnissii, Vibrio harveyi , Vibrio parahaemolyticus, and Vibrio vulnificus. In a further 
aspect of this embodiment, the method of identifying sequences v^hich are involved in 

20 directing transcription comprises mutagenizing the promoter. In another aspect of this 

embodiment, the method of identifying sequences which are involved in transcription 
comprises constructing deletions in the promoter. 

Another embodiment of the present invention is a method for identifying a 
regulatory element capable of directing or regulating transcription within a test nucleic 

25 acid sequence comprising providing a construct comprising the test nucleic acid 

sequence operably linked to a nucleic acid encoding a cytoplasmic form of chitobiase, 
introducing the construct into host cells, and determining the level of chitobiase activity. 
In one aspect of this embodiment, the cytoplasmic form of chitobiase lacks a signal 
sequence. In another aspect of this embodiment, the nucleic acid encoding a 

30 cytoplasmic fonn of chitobiase encodes a fusion protein, the fusion protein comprising 

a cytoplasmic form of chitobiase fused to a heterologous polypeptide. In still another 
aspect of this embodiment, the nucleic acid encoding a cytoplasmic form encodes a 
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cytoplasmic form of chitobiase obtained from an organism selected from the group 
consisting of Alteromonas sp, 0-7, Arabidopsis thaliana, Bacillus subiilis, Bombyx 
mori, Bos taurus, Caenorhabditis elegans, Candida albicans, Dictyostelium 
discoideum, Entamoeba histolytica, Felis catus, Homo sapiens, Korat cats, 

5 Lactobacillus casei, Leishmania donovani, Mus musculus, Pisum sativum, 

Porphyromonas gingivalis, Pseudoalteromonas sp. S9, Rattus norvegicus, Serratia 
marcescens, Streptomyces plicatus, Streptomyces thermoviolaceus, Sus scrofa, 
Trichoderma harzianum, Vibrio furnissii, Vibrio harveyi , Vibrio parahaemolyticus, and 
Vibrio vulnificus. In one aspect of this embodiment, the reporter gene construct is 

10 introduced transiently. In another aspect of this embodiment, the reporter gene 

construct is introduced stably. The host cells may be selected from the group consisting 
of prokaryotic cells and eukaryotic cells. In another aspect of this embodiment, the 
method further comprises permeabilizing or lysing the host cells. The permeabilizing 
or lysing step may comprise treating the host cells with toluene. The step of 

15 determining the level of chitobiase activity may be selected from the group consisting 

of measuring the amount of a chemiluminescent product produced from a substrate, 
measuring the amount of a fluorescent product produced from a substrate, measuring 
the amount of hght absorbed by a product produced from a substrate and measuring a 
decrease in the amount of a detectable substrate. In another embodiment, the step of 

20 determining the level of chitiobiase activity may comprise determining the level of p- 

nitrophenol released from a substrate. In another aspect of this embodiment, the test 
nucleic acid sequence comprises a portion of genomic DNA. In a further aspect of this 
embodiment, the step of determining the level of chitobiase activity comprises 
determining the level of chitobiase activity after exposing the host cells to a desired set 

25 of environmental conditions. In still another aspect of this embodiment, the step of 

determining the level of chitobiase activit}^ comprises determining the level of 
chitobiase activity after contacting the host cells with a compound to be tested for its 
influence on the level of transription from siad regulartory element. 

Another embodiment of the present invention is a method of detecting 

30 successfiil transformation, comprising the steps of introducing a nucleic acid encoding a 

cytoplasmic foim of chitobiase into host cells, and detecting chitobiase expression in 
the host cells. 
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Another embodiment of the present invnetion is a fusion protein-reporter gene 
construct comprising a promoter operably linked to a nucleic acid encoding a 
cytoplasmic form of chitobiase fused in frame with a nucleic acid encoding a 
heterologous polypeptide, wherein the heterologous polypeptide is not p-galactosidase 

5 or a portion thereof, and wherein the heterologous polypeptide does not contain a signal 

peptide. In one aspect of this embodiment, the nucleic acid encodes a cytoplasmic form 
of chitobiase obtained from an organism selected from the group consisting of 
Alteromonas sp. 0-7, Arabidopsis thaliana, Bacillus subtilis, Bombyx mori, Bos taurus, 
Caenorhabditis elegans, Candida albicans, Dictyostelium discoideum, Entamoeba 

10 histolytica, Felis catus, Homo sapiens, Korat cats, Lactobacillus casei, Leishmania 
donovani, Mus musculus, Pisum sativum, Porphyromonas gingivalis, 
Pseudoalteromonas sp, S9, Rattus norvegicus, Serratia marcescens, Streptomyces 
plicatus, Streptomyces thermoviolaceus, Sus scrofa, Trichoderma harzianum, Vibrio 
furnissii, Vibrio harveyi , Vibrio parahaemolyticus, and Vibrio vulnificus. In another 

15 aspect of this embodiment, the nucleic acid further comprises a X site-specific 

recombination sequence. 

Another embodiment of the present invention is a reporter gene construct 
comprising plasmid pJMF3. 

Another embodiment of the present invention is a reporter gene construct 

20 comprising plasmid pJMF4. 

Another embodiment of the present invention is a reporter gene construct 
comprising plasmid pDYK9. 

Another embodiment of the present invention is a reporter gene construct 
comprising plasmid pDYKl 1 . 

25 Another embodiment of the present invention is a host cell comprising a fusion 

protein-reporter gene construct comprising a promoter operably linked to a nucleic acid 
encoding a cytoplasmic form of chitobiase fused in frame with a nucleic acid encoding 
a heterologous polypeptide, wherein the heterologous polypeptide is not p-galactosidase 
or a portion thereof, and wherein the heterologous polypeptide does not contain a signal 

30 peptide. In one aspect of this embodiment, the nucleic acid is integrated into a 

chromosome of the cell In another aspect of this embodiment, the nucleic acid is 
transiently expressed in the host cell. 
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Another embodiment of the present invention is a nucleic acid encoding a 
cytoplasmic form of chitobiase in which the signal sequence of native chitobiase has 
been inactivated or deleted. In one aspect of this embodiment, the signal sequence has 
been mutated to inactivate it. 

5 An isolated or purified polypeptide comprising a cytoplasmic form of chitobiase 

fused in frame with a heterologous polypeptide, wherein the heterologous polypeptide is 
not p-galactosidase or a portion thereof and wherein the heterologous polypeptide does 
not contain a signal peptide. 

Another embodiment of the present invention is an isolated or purified 

10 polypeptide comprising a cytoplasmic form of chitobiase in which the signal peptide of 

native chitobiase has been inactivated or deleted. In one aspect of this embodiment, the 
signal sequence has been mutated to inactivate it. 

Another embodiment of the present invention is a method for monitoring the 
activity of a promoter comprising providing a construct comprising the promoter 

15 operably linked to a nucleic acid encoding a cytoplasmic form of chitobiase, 

introducing the construct into host cells, and determining the level of chitobiase activity. 
In one aspect of this embodiment, the cytoplasmic form of chitobiase lacks a signal 
sequence. In another aspect of this embodiment, the nucleic acid encoding a 
c>toplasmic form of chitobiase encodes a fusion protein, the fusion protein comprising 

20 a cytoplasmic form of chitobiase fused to a heterologous polypeptide. In one aspect of 

this embodiment, the nucleic acid encoding a cytoplasmic form encodes a cytoplasmic 
form of chitobiase obtained from an organism selected from the group consisting of 
Alteromonas sp. 0-7, Arabidopsis thaliana, Bacillus subtilis, Bombyx mori, Bos taurus, 
Caenorhabditis elegans, Candida albicans, Dictyostelium discoideum, Entamoeba 

25 histolytica, Felis catus, Homo sapiens, Korat cats, Lactobacillus casei, Leishmania 

donovani, Mus musculus, Pisum sativum, Porphyromonas gingivalis, 
Pseudoalteromonas sp. S9, Rattus norvegicus, Serratia marcescens, Streptomyces 
plicatus, Streptomyces thermoviolaceus, Sus scrofa, Trichoderma harzianum. Vibrio 
furnissii, Vibrio harveyi , Vibrio parahaemolyticus, and Vibrio vulnificus. In another 

30 aspect of this embodiment, the reporter gene construct is introduced transiently. In still 

another aspect of this embodiment, the reporter gene construct is introduced stably. In a 
further aspect of this embodiment, tlie host cells are selected from the group consisting 
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of prokaryotic cells and eukaryotic cells. In yet another aspect of this embodiment, the 
method further comprises permeabilizing or lysing the host cells. For example, the 
permeabilizing or lysing step may comprise treating the host cells with toluene. In yet 
another aspect of this embodiment, the step of determining the level of chitobiase 
5 activity may be selected from the group consisting of measuring the amount of a 

chemiluminescent product produced from a substrate, measuring the amount of a 
fluorescent product produced from a substrate, measuring the amount of light absorbed 
by a product produced from a substrate and measuring a decrease in the amount of a 
detectable substrate. In still another aspect of this embodiment, the step of determining 

Q 

10 the level of chitiobiase activity comprises determining the level of/?-nitrophenol 

released from a substrate. In still another aspect of this embodiment, the step of 
determining the level of chitobiase activity comprises determining the level of 
chitobiase activity after exposing the host cells to a desired set of environmental 
conditions. In a further aspect of this embodiment, the step of determining the level of 

15 chitobiase activity comprises determining the level of chitobiase activity after 

contacting the host cells with a compound to be tested for its influence on the level of 
transription from siad regulartory element. For example, the compound may comprise a 
compound to be tested for activity as a drug. 

Brief Description of the Drawings 

20 Figure 1 illusti-ates plasmids pJMF3 and pJMF4 containing attP in 2 different 

orientations and the lac promoter with the first 21 amino acids of lacZa (fi'om pUC19) 
fixsed in-frame to the chb gene. The sequence of the fiasion region is shown in Figiore 2. 
Restriction sites shown are found once in the plasmid sequences except for Notl, which 
has 2 sites flanking the PI 5 A origin. 

25 Figure 2 illustrates the sequence of the lac promoter and the chitobiase fiision 

found in pJMF3 and pJMF4 (SEQ ID NOS: 15 and 16). Fusion between lacZa (from 
pUC19) and chb [Soto-Gill, R.W. et al, J. Biol. Chem. 264:14778-14782 (1998)] is 
indicated by (/); start of transcription is indicated by (+1). Sequence and binding sites 
in the lac promoter regulatory region are found in Dickson, R.C. et al.. Science 187:27- 

30 32 (1975), the disclosure of which is incorporated herein by reference in its entirety. 

Restriction enzyme sites shown in Figure 2 are found once in the plasmid sequences 
except for Sphl, which has 2 sites; these different restriction sites can be used to replace 
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the lac promoter with another promoter together with part of a coding region to produce 
an in-frame fusion with chb. 

Figure 3 depicts integration of chb fusions into the chromosome by site-specific 
recombination between attB and attP, The steps involved are described below and in 
Diederich, L.L, et ah, Plasmid 28:14-24 (1992), the disclosure of which is incorporated 
herein by reference hi its entirety. 

Figure 4 illustrates plasmids pDYK9 and pD YKl 1 which contain dnaA-chb 
fusions. pDYK9 is deleted for the rpm/f regulatory region. The orientation of a^/P in 
pDYK9 and pDYKl 1 is the same as that of pJMF3. After integration at attB of the 
larger Notl fragment, the dnaA promoters are oriented to transcribe in the same 
direction as replication fork movement. 

Figure 5 illustrates the rmpH-dnaA regulatory region and (i?7a/4-chitobiase fusion 
(SEQ ID NO: 17). The dnaA box and promoters are shaded, and the coding region of 
the dnaA gene is striped. The fusion contains two amino acids (between the 
backslashes) from pUC19. The region cloned into pDYK9 is between Sphl pruner II 
and Kpnl primer, and the region cloned into pDYKl 1 is between Sphl primer I and 
Kpnl primer. The niombers above the primers refer to the nucleotide in the sequence 
amplified. For sequence numbering and the locations of promoters and protein binding 
sites, see Froelich, J.M. et al, J. BacterioL 178:6006-6012 (1996); Hansen, F.G. et al., 
EMBO J. 1 :1043-1048 (1982) and Messer, W. and C.W. Weigel, Initiation of 
Chromosome Replication, p. 1579-1601 (1996) In F.C. Neidhart, R. Curtiss III, J.L. 
Ingraham, E.CC. Lin, K.B. Low, B. Magasanik, W.S. Reznikoff, M. Riley, M. 
Schaechter and H.E. Umbarger (Eds.), Escherichia coli and Salmonella Cellular and 
Molecular Biology., ASM Press, Washington, D.C., the disclosures of which are 
incorporated herein by reference in their entireties. 

Detailed Description of die Preferred Embodiment 
Reporter genes and reporter gene constructs play a number of important roles in 
a variety of molecular biology techniques. For example, reporter genes may be used to 
determine whether a sequence contains a promoter or other cis-acting element which 
directs transcription, such as an enhancer. In addition, reporter genes may be used to 
identify regulatory sites in promoters or other cis-acting elements and to determme the 



effects of mutating these regulatory sites on the level of gene expression directed by the 
promoters or other cis-acting elements. Reporter genes may also be used to detect 
successfiil transformation. In addition, reporter genes may be used to monitor gene 
expression under various conditions and to identify drugs. 

The structure of a reporter gene construct containing the cytoplasmic form of 
chitobiase will vary according to its purposes. When the reporter gene construct is a 
vector, one must decide between a vector that incorporates the reporter gene into the 
host's genome or one that replicates extrachromosomally, such as a plasmid. When the 
reporter construct is not designed to integrate into the host genome, the vector can 
contain an origin of replication with activity in the host cell of interest. This feature 
provides the reporter gene vector the ability to replicate within the host cell in which it 
has been introduced. In addition to the origin of replication, reporter constructs often 
contain a promoter, a multiple cloning site, a selectable marker, and of course a reporter 
gene. Reporter constructs for use in eukaryotic cells may also contain a polyA site 
adjacent to the reporter gene. 

Given the utility of reporter gene constructs, it is not surprising that a number of 
cytoplasmic reporter gene constructs and different reporter genes are available for use 
by those of skill in the art. For example, the cytoplasmic reporter enzymes 
chloramphenicol acetyltransferase (CAT), firefly luciferase, P-glucuronidase (GUS), 
green fluorescent protein (GFP), and B-galactosidase have been used extensively. 
However, such reporters all have individual shortcomings that may limit or preclude 
their usage under some conditions. For example, high levels of GFP are toxic to the 
cell. In addition, reporter enzymes are not expressed equally in all cell types nor are 
they equally stable when expressed in all cell types. Furthermore, there is a recognized 
need for multiple reporter enzymes that can be assayed independently of one another in 
order to simultaneously study the regulation of multiple genes within a single cell type. 
Therefore, there exists a continuing need to identify reporter enzymes with useful 
properties. 

The cytoplasmic enzyme B -galactosidase is widely used as a reporter gene in 
various microbiological and molecular biological studies. This enzyme is used in both 
in vitro and in vivo assays. The wide acceptance of this reporter system results, in part, 
because it is non-isotopic and extremely flexible. It is used in a number of assay 



formats and has an extremely broad linear range. Nevertheless, because B-galactosidase 
is present in the cytoplasm of various host cells such as Escherichia coli, deletion of the 
lacZ gene, the source of the enzyme, is often required prior to its use in a host cell 
system. One goal of the present invention was to provide an alternative intracellular 
5 enzyme for use as a reporter. 

An extensive discussion of various molecular biology techniques is available in 
Ausubel, et al, (eds) "Short Protocols in Molecular Biology " Wiley and Sons, Inc., 
New York (1997), the disclosures of which are incorporated herein by reference in their 
entireties. Examples of such techniques include isolating and preparing DNA for 
10 manipulation, gel electrophoresis, polymerase chain reaction (PGR), determining 

nucleic acid sequences, screening nucleic acid libraries, mutagenesis of DNA, and 
introducing DNA into host cells. 

The structxire of a reporter gene construct will vary according to its purposes. 
The reporter gene constructs are constructed according to standard techniques of 
1 5 molecular biology well known in the art. 

When the reporter gene construct is a vector, one must decide between a vector 
that incorporates the reporter gene into the host\s genome or one that repUcates 
extrachromosomally, such as a plasmid. Wlien integration of the reporter gene is a 
desired resuh, a reporter gene construct will contain sequences that will facilitate 
20 incorporation. 

One example of integration sequences that can be included in a reporter gene 
construct is the X attP site. This site permits a single copy of the reporter gene to be 
incorporated into a host bacterial genome. Integration-promoting sequences with utility 
in mammalian cells include the long terminal repeats found in retroviral genomes. 
25 These sequences promote viral genome integration in a host genome and have been 

used extensively by those of skill in the art to promote the integration of exogenous 
sequences in mammalian host cells. 

When the reporter construct is not designed to integrate into the host genome, it 
is common that the vector contain an origin of replication with activity in the host cell 
30 of interest. This feature provides the reporter gene vector the ability to replicate within 

the host cell in which it has been introduced. 
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In addition to the origin of replication, reporter constructs will often contain 
additional features that promote the expression of the nucleic acid sequence or 
sequences contained in the construct. These additional sequences can include a polyA 
site, a multiple cloning site, a drag resistance marker, and of course a reporter gene. 

The present invention relates to the use of chitobiase as a reporter gene. The 
chitobiase may be used as a reporter in bacteria, plants, mammalian cells and other host 
cell hnes. One possible alternative to using B-galactosidase as a reporter gene was to 
develop cytoplasmic N,N'-diacetylchitobiase (AT-acetyl-B-D-glucosaminidase, EC 
3.2.1.30) for use as a reporter enzyme. One advantage of the enzyme iV-acetyl-6-D- 
glucosaminidase or "chitobiase" over B-galactosidase is that genes encoding chitobiase 
are missing from many bacteria, including E. coli, some fimgi, and some eukaryotic 
cells. Thus, it is not necessary to engineer many host cells to lack reporter activity as is 
the case with fi-galactosidase. The present invention also relates to various protein 
expression vectors that can be used to express the reporter gene. In addition, the present 
invention may be used in conjunction with other reporter enzymes in systems ui which 
the regulation or activities of multiple genes is to be studied simultaneously. 

Chitobiase is one of two enzymes that hydrolyze chitin, an abundant insoluble 
polysaccharide, to its monomeric unit, iV-acetylglucosamine (GlcNac). Chitobiase is 
known to be present in a number of organisms. For example, the chitobiase enzyme is 
known to be present in various genera including Arabidopsis, Bacillus, Bombyx, Bos, 
Caenorhabditis, Candida, Dictyostelium, Entamoeba, Felis, Homo, Karat, 
Lactobacillus, Leishmania, Mus, Pisum, Porphyromonas, Pseudoalteromonas, Rattus, 
Serratia, Streptomyces, Sus, Trichoderma, and Vibrio. Specific examples of organisms 
known to contain chitobiase include Alteromonas sp. 0-7, Arabidopsis thaliana. 
Bacillus subtilis, Bombyx mori. Bos taurus, Caenorhabditis elegans, Candida albicans, 
Dictyostelium discoideum. Entamoeba histolytica, Felis catus, Homo sapiens, Korat 
cats, Lactobacillus casei, Leishmania donovani, Mus musculus, Pisum sativum, 
Porphyromonas gingivalis, Pseudoalteromonas sp. S9, Rattus norvegicus, Serratia 
marcescens, Streptomyces plicatus, Streptomyces thermoviolaceus, Sus scrofa, 
Trichoderma harzianum. Vibrio furnissii. Vibrio harveyi , Vibrio parahaemolyticus, and 
Vibrio vulnificus. 
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One source of the enzyme is the marine bacterium, Vibrio harveyi. Escherichia 
coli cells harboring a plasmid carrying the chb gene from Vibrio harveyi were reported 
to produce the enzyme, which was found to be associated with the outer membrane of 
the bacterial cells. These are described in Jannatipour, M. et al., "Translocation of 
5 Vibrio harveyi iV.TV'-diacetylchitobiase to the outer membrane of Escherichia coli.,'' J. 
Bacteriol. /69:3785-3791 (1987) and Soto-Gil & Zyskind, iV;iV'-Diacetylchitobiase of 
Vibrio harveyi primary structure, processing, and evolutionary relationships. J. Biol 
Chem. 2(54:14778-14782 (1989), both of which are hereby mcorporated by reference. 
The present invention contemplates expressed cytoplasmic forms of chitobiase 
iO in various forms. In one embodiment, the signal sequence is deleted from the amino 
terminal portion of the protein. Presumably the removal of this sequence results in the 
expression of a cytoplasmic form of the enzyme that is not secreted from the host cell or 
incorporated into the membrane of the host cell producing the enzyme. 

The present invention also contemplates the generation of fiision proteins 
1 5 comprising a fusion polypeptide joined in frame to chitobiase. Preferably, the fiision 
polypeptide comprises a polypeptide other than chitobiase, such as a heterologous 
protein. The heterologous polypeptide may comprise a polypeptide havmg a biological 
activity (such as an enzymatic or other activity besides activity as an immunogen) or the 
heterologous polypeptide may not have a biological activity. The heterologous 
20 polypeptide does not include a signal sequence which directs its secretion. Preferably 
the heterologous polypeptide is not p-galactosidase or a portion thereof Thus, the 
fiision reporter gene construct contains a sequence encoding the fiision polypeptide 
genetically fused in frame with a sequence encoding chitobiase. In one embodiment, 
this fusion may remove the amino-terminal signal peptide sequence of chitobiase and 
25 replace it with a heterologous protein. 

In another embodiment, the fusion protein construct comprises a chitobiase gene 
sequence that has been truncated to remove at least the signal peptide sequence of the 
gene. Alternatively, mutations may be introduced into the signal peptide sequence so 
that it is no longer functional. Such mutations may be introduced using a variety of 
30 techniques familiar to those skilled in the art, including site directed mutagenesis, 
cassette mutagenesis, and chemical mutagenesis. 
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Once the reporter gene construct is made it is introduced into a host cell line for 
testing. Host cells of prokaryotic and eukaryotic origin can be used with the reporter 
gene constructs of the present invention. A variety of methods are available to 
introduce reporter gene constructs into host prokaryotic cells. For example, bacteria can 
be transformed using calcium chloride, electroporation, or viral vectors such as the 
filamentous phages. These and other prokaryotic transformation protocols are well 
known in the art. 

Alternatively, the sequence encoding chitobiase may be introduced in eukaryotic 
cells, including yeast, mammalian, plant, and insect cells. For example, the sequence 
encoding chitobiase may be inserted into a yeast artificial chromosome, a yeast plasmid, 
a bovine papilloma virus vector or other extrachromosomal vector, a retroviral vector, a 
Ti-plasmid, or a baculovirus vector. A variety of such vectors are known to those 
skilled in the art. The vectors may be introduced into any of the yeast, mammalian, 
plant, and insect cells familiar to those skilled in the art. 

The introduction of the reporter gene constmct into mammalian cells can 
likewise utilize a number of transfection protocols well known to those of skill in the 
art. As discussed above, transfections can be transient or stable. Examples of suitable 
transfer protocols include calcium phosphate transfection, DEAE-Dextran, 
electroporation, liposome-mediated transfection, and viral transfection. These and other 
eukaryotic transformation protocols are well known in the art. 

Following introduction of the reporter gene construct into the host cell of 
interest, the enzymatic activity of the reporter gene is measured. Preferably, tiie 
chitobiase assays are performed after permeabiUzing or lysing the host cells. There are 
a variety of cell permeabilization and cell lysis procedures available to those of ordinary 
skill in the art, including methods such as sonication or lysozyme treatment. One 
embodunent of the present invention uses toluene treatment to permeabilize cells. The 
details of this method are discussed in D.Y. Kalabat et al., BioTechniques 25:1030-1035 
(1998) and Miller , J.H. A Short Course in Bacterial Genetics, CSH Laboratory Press, 
Cold Sprmg Harbor, NY 1992, the disclosures of which are incorporated herein by 
reference in their entireties. 

Cellular chitobiase activity can be measured quantitatively by following the 
hydrolysis of chitobiase substrates. Examples of substrates with utility in chitobiase 
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activity assays include A^,iV'-diacetylchitobiose (chitobiose),/7-nitrophenyl-iV-acetyl-li- 
D-glucosaminide (PNAG)(Sigma Chemical, St. Louis, MO), and 5-bromo-4-chloro-3- 
indolyl-7\^-acetyl-6-D-glucosaminide (X-Gluc)(Sigma Chemical, St. Louis, MO). Other 
substrates are also contemplated for use in the assays of the present invention. 

Products produced by the hydrolysis of the chitobiase substrates are monitored 
using various means familiar to those skilled in the art. For example, various optical 
means are known to those skilled in the art. One such optical means may comprise 
detection of chemiluminescent or fluorescent products released from a substrate. 
Alternatively, the level of chitobiase activity may be determined by measuring the 
amount of light absorbed by a product produced from a substrate or measuring a 
decrease in the amount of a detectable substrate. In one embodiment, /?-nitrophenol is 
released from the substrate and measured at 400 nm. Other monitoring methods well 
known in the art can be used to quantitate signals produced in the chitobiase assay. 
These may inlcude use of radioactive subsfrates or substrates having radiofrequency 
tags. In another embodiment, blue/white colony indicator plates are used to monitor 
enzyme activity. 

Another embodiment of the present invention is a kit. One aspect of this 
embodiment includes a reporter gene construct comprising a vector containing a 
chitobiase reporter gene. The reporter gene construct also contains a multiple cloning 
site containing a variety of restriction endonuclease cutting sites that facilitate the 
introduction of exogenous DNA into the construct. 

The kit embodiment of the present invention also includes those components 
necessary to assay for chitobiase activity produced by the reporter gene construct. For 
example, in one embodiment, the kit will include a supply of a suitable chitobiase 
substrate whose metabolism into product by the reporter enzyme can be assayed. 

EXAMPLES 

The following Examples are disclosed to assist in the understanding of the 
present invention. The Examples below should not be construed to limit the scope of 
the invention and such variations of the invention now known or later developed, which 
would be within the purview of one of ordinary skill in the art and are considered to fall 
within the scope of the invention hereinafter clauned. 
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Example 1 
Construction of Vectors 
Vectors were constructed using polymerase chain reaction (PCR) products that 
were cloned first at the Smal or EcoRY restriction sites of plasmid pBluescript II 

5 (pKSII+)(Stratagene; San Diego, CA). The nucleotide sequence of all PCR products 
was detemiined using standard techniques well known in the art. Unmethylated 
plasmid DNA was isolated from an E. coli dam strain when cutting with the BcK 
enzyme was required. 

The pDYK9 (SEQ ID NO: 1 1) plasmid was constructed by ligating a Sphl-Kpnl 

10 PCR product containing the dnaA promoter region into plasmid pRSG196 that contains 
the V. harveyi chitobiase gene (see Jannatipour, M., et ai.). Briefly, the V. harveyi 
chitobiase gene was prepared by cloning a Sau3A partial digest of V, harveyi DNA into 
the single BamUl site with the tet gene of pMK2004 [See Soto-Gill & Zyskind, 
"Cloning of Vibrio harveyi chitinase and chitobiase genes in Escherichia coli'' pp. 209- 

15 223, In J.P. Zikakis (ed.), Chitin, chitosan, and related enzymes/' Academic Press, Inc., 
New York (1984)]. The clones containing the gene of interest were detected by the 
presence of the yellow jE?-nitrophenylate product after individual colonies of the clone 
bank were sprayed with 10 mM PNAG dissolved in 100 mM sodium phosphate, pH 7.0. 
Restriction maps of the positive clones were made according to standard techniques 

20 well known in the art. 

The chb gene was subcloned in a 3.5-kb EcoRL fragment from pRSG14 [See 
Jannatipour, M. et al, "Translocation of Vibrio harveyi iV^A^ -diacetylchitobiase to the 
outer membrane of Escherichia coli,'' L Bacteriol. 759:3785-3791 (1987)] into the 
EcoRl site of pUC19 [Yanisch-Perron, c.J. et ah. Gene 33: 103-1 ] 9 (1985)]. Plasmid 

25 pRSG196 was constructed by deleting a 0.5-kb Sstl fragment from one of these clones 
[See Jannatiponr, M. et al., "Translocation of Vibrio harveyi iV.A^'-diacetylchitobiase to 
the outer membrane of Escherichia coli.'' J. BacterioL 759:3785-3791 (1987)]. 

The pRSG196 plasmid was also cut with Sphl and Kpnl to accommodate the 
insertion of the dnaA fragment. Plasmid pAC17 [described in Chiaramello & Zyskind, 

30 "Coupling of DNA replication to growth rate in Escherichia coli: a possible role for 

guanosine tetraphosphate," J. BacterioL 172:2013-2019 (1992) hereby incorporated by 
reference] served as template with primers 5'-GCA CAT GCA TGC TGG TCA TTA 
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AAT TTT CC-3' (SEQ ID NO 1) and 5'-CGG GGT ACC AAC TCA TCC TGC AAT 
CG-3' (SEQ ID NO 2) producing a PGR product 374 bp long that contains 353 bp from 
the dnoA promoter region [bases 583 to 935, numbering according to Hansen, F. G., et 
al., "The nucleotide sequence of the dmA gene promoter and of the adjacent rpmH 
gene, coding for the ribosomal protein L34, of Escherichia coli" EMBO J. 1 :1043-1048 
(1982)]. The forward primer (Sphl primer II) contained an Sphl site and the reverse 
primer (Kpnl primer) contained a Kpnl site for cloning. Tliis created an in-frame fusion 
between the amino-terminal 17 amino acids of DnaA and the carboxy-terminal end of 
chitobiase deleted for the amino-terminal 22 amino acids including the signal peptide. 

The next step in constructing the pDYK9 vector was takmg the 3270 bp Dral- 
Hindlll (partial digest) fragment containing the dnaA-chb fiision and ligating it to the 
pACYC184 2555 bp Hincll-Hindm fragment carrying chloramphenicol-resistance 
(Cm") and the PI 5 A origin. This fragment was described in Chang & Cohen, 
"Construction and characterization of ampUfiable multicopy DNA cloning vehicles 
derived from the pl5A cryptic miniplasmid," J. Bacteriol. 134:1141-1156 (1987). 

A Notl site was introduced at the Accl site after digestion with^ccl, treatment 
with Mung bean nuclease, and ligation to phosphorylated Notl linkers (New England 
Biolabs, Inc.; Beverly, MA). A Notl site was introduced at the Asel site after digestion 
with^^el, treatment with Mung bean nuclease, and ligation to phosphorylated Notl 
linkers (New England Biolabs, Inc.; Beverly, MA). AxiXbal-Sphl PGR product 
containing the rrnBtlt2 terminator was ligated mto this plasmid cut with the same 
enzymes creating pDYK7. E. coli chromosomal DNA served as template with prumers 
5'-CTA GTC TAG ATG CCG AAC TCA GAA GTG A-3' (SEQ ID NO 3) and 5'-GCA 
CAT GCA TGC GGG GGA TGG CTT GTA GAT-3' (SEQ ID NO 4) to produce a PGR 
product 357 bp long that contains bases 6534 to 6869 from the rrnB operon [numbering 
according to Brosius, J., et al., "Gene organization and primary structure of a ribosomal 
RNA operon from Escherichia coli" J. Mol. Biol. 148:107-127 (1981)] and includes 
the complex transcription termmation region of this operon which is described in Orosz, 
A., et al, "Analysis of the complex transcription termination region of the Escherichia 
coli rrnB gene," Eur. J. Biochem. 20i:653-659 (1991). The forward primer contained 
anX6aI site and the reverse pruner contained an Sphl site. 
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A Bcll-Smal PGR product containing the X attP site was ligated into pDYK7 cut 
with TthJlll, treated with Mung bean nuclease, then digested with BcK. Plasmid 
pHN894 [described in Goodman, S. D., et al., "Deformation of DNA during site- 
specific recombination of bacteriophage lambda: Replacement of IHF protein by HU 
protein or sequence-directed bends," Proc. Natl. Acad. USA 5P:11910-11914 (1992)] 
served as template with the forward primer 5'-CAT GAT CAT GCG ACA GGT TTG 
ATG A-3' (SEQ ID NO 5) and the reverse primer 5'-GGG GGC GCC TAG CTT TGA 
GGA G-3' (SEQ ID NO 6) producing a PGR product 466 bp long that contains the A, 
attP site. The 466 bp PGR product containing the % attP site was first cloned into the 
Smal site of the pKSII+ plasmid to produce pDYK8. The forward primer contains a 
BcR site and the reverse primer contains G's at tlie 5' end in order to recreate a Smal 
site when cloning the PGR fragment into a Smal site. This PGR product includes bases 
-211 to +241 from the center of the attP core and the sequence required for optimum X, 
attP site integration as described inNagaraja & Weisberg, "Specificity determinants in 
the attachment sites of bacteriophages HK022 and X," J. Bacteriol. 172:6540-6550 
(1990). The orientation oiattP is such that when the fusion is integrated at attB, the 
transcription direction of dnaApl and p2 promoters is the same as replication fork 
movement mimicking the orientation at the wild type dnoA promoters. 

Vector pDYKl 1 (SEQ ID NO: 12) was constructed by ligating an Sphl-Kpnl 
PGR product containing the rpmH-dnaA promoter region into pDYK9 also digested 
with Sphl and Kpnl. Plasmid pAG17, described in Ghiaramello & Zyskind, (1992), 
serv-ed as template with the primers 5'-CAT GGA TGG ATG AAA GGA TGG AGA 
GG-3' (SEQ ID NO 7) and 5'-GGG GGT AGG AAG TGA TGG TGG AAT GG-3' (SEQ 
ID NO 8) to produce a PGR product 616 bp long that contains 598 bp from the rpmH- 
dnoA regulatory region [bases 338 to 935, numbering according to Hansen, F. G., et al., 
(1982)]. The forward primer {Sphl primer I) contained an Sphl site and the reverse 
primer {Kpnl primer) contained a Kpnl site for cloning. SEQ ID NO: 18 provides the 
complete coding sequence of the DNA encoding the dnaA/chitobiase fusion protein. 
The complete sequence of the dnaA/chitobiase fusion protein is provided in SEQ ID 
NO: 19. 

The vector pJMF3 (SEQ ID NO: 13) was constructed by first ligating an^5el 
linker (5'-CATTAATGGATG-3' (SEQ ID NO 9) self-hybridized) into the Sphl site of 
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pDYKll. The pUC19, [described in Yanisch-Perron C, et al., "Improved M13 phage 
cloning vectors and host strains: nucleotide sequences of the M13mpl8 and pUC19 
vectors," Gene 53:103-19 (1985)] Asel-Kpnl fragment containing the /acPO-poIylinker 
region was ligated to this plasmid after digestion with^^el and Kpnl. The resulting m- 
frame fusion between the amino-terminal 21 amino acids of the pUC19 lacZ(a) peptide 
[Yanisch-Perron C, et al , 1985 J and the carboxy-terminal end of chitobiase deleted for 
the amino-terminal 22 amino acids is identical to the protein fusion in pRSG196 
[Jannatipour, M., et al., (1995)]. 

Vector pJMF4 (SEQ ID NO: 14 ) was constructed by ligating tiie BaniHl- 
EcoRY fragment of pDYK8 containing the X attP site into pDYK7 which had been 
digested with Tthllll, treated with Mung bean nuclease, then digested with BcR, to 
create pTKP9. This reversed the orientation of the attP site relative to the attP site in 
pDYK9. The pDYKl 1 BspUll-Kpnl fragment containing the rpmH-dnoA promoter 
region was then ligated into pTBCP9 digested with the same enzymes to create pTKPl 1 . 
This reversed the orientation of the attP site relative to the attP site in pDYKl 1 . An 
Asel linker (5'-CATTAATGCATG-3' (SEQ ID NO 10) self-hybridized) was Ugated 
into the Sphl site of pTKPl 1 to create pJMF2. The pUC19 [as described in Yanisch- 
Perron C, et al., (1985)] Asel-Kpnl fragment containing the /acPO-polylinker region 
was ligated to pJMF2 cut with Asel and Kpnl to create pJMF4. 

Example 2 
Site-specific Recombination 
To move the chitobiase fusions in pDYK9 and pDYKl 1 to the attB site in the 
chromosome, Notl fragments from these plasmids were self-ligated and transformed or 
electroporated into strain WM2269 (DH5a containing pLDR8) [See Zyskind, J. W. and 
S. I. Bernstein (1992) "Recombinant DNA Laboratory Manual," Academic Press, San 
Diego, CA]. Plasmid pLDR8, described in Diederich, L. L. J., et al., "New cloning 
vectors for integration into the X attachment site attB of Escherichia coli chromosome," 
Plasmid 25:14-24 (1992), expresses integrase from the X Pr promoter and contains the % 
chsi repressor gene, a kanamycin resistance gene, and a temperature-sensitive origin of 
replication. The transformed or electroporated cells were incubated at 42°C with 
shaking for 30 min then moved to 37°C for 1 h followed by selection on Luria broth 
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agar plates containing chloramphenicol (25 jag/ml) at 42^C. Transformants were 
screened for loss of kanamycin resistance and, therefore, loss of pLDR8. 

Example 3 
Bacteriophage PI Transduction 
5 Transduction with PI bacteriophage by the method of Zyskind & Bernstein, 

Recombinant DNA Laboratory Manual. Academic Press, San Diego, CA (1992) was 
used to construct strains and to confirm the chromosomal location of the dnaA~chb 

fusions. Cotransduction of Cm^ (carried by the fusion) and galK (linked to attB) 
indicated that Cm^ and galK are linked on the chromosomes of strains D YK9W, 
10 DYK9F, DYKl IW, and DYKllF. 

Example 4 
p-Nitrophenol Chitobiase Activity Assay 
Chitobiase activity is located in the cytoplasm when its signal peptide is 
replaced by fusion with another peptide. Accordingly, chitobiase assays are performed 

15 on permeabilized or lysed cells, A variety of permabilization or cell lysis protocols are 

available to liberate the enzyme from within a cell population being tested. One such 
protocol involves toluene-treated cells washed once with M9 salts according to the 
method of Miller, J.H. A Short Course in Bacterial Genetics, CSH Laboratory Press, 
Cold Spring Harbor, NY 1992, wliich is hereby incorporated by reference. The 

20 toluenized cells are placed in a chitobiase buffer (10 mM Tris-HCl, pH 8.0, and 0.5 M 
NaCl) and 666 ^M PNAG, the chitobiase enzyme substrate. NaCl is included because 
chitobiase has approximately 80% of full activity in the absence of salt, with maximal 
activity occurring between 0.25 and 0.6 M NaCl. Toluenized cells (0.772 ml) are 
preincubated at 28°C and the reaction started with the addition of 0.228 ml PNAG (1 

25 mg/ml). After incubation at 28°C, the reaction is stopped by the addition of 1 ml of 1 M 
Tris base. The release of /7-nitrophenol is measured at 400 nm and turbidity at 550 rmi. 
j!?-Nitrophenoi release is measured immediately at 400 nm with a molar absorptivity of 
L8 X 10^ liters mol"^ cm'^ Units are calculated after subtracting the light scattering 
factor (1 ,5 x OD550) from OD400 of the sample. The normaKzing factor of 1 .5 was 

30 determined previously by measuring the light scattering ratio of bacteria at OD4QQ and 

OD550. One unit of chitobiase activity is the amount of enzyme that catalyzes the 
formation of 1 pmol of j^?-nitrophenol per min at 28°C, For comparison to Miller units 
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of 15-galactosidase [described in Miller, J. H., A Short Course in Bacterial Genetics. 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1992)], the units are 
normalized to 1 ml of culture at OD^^^^^l, 

Example 5 

Determination of Chitobiase Activity in Cells Containing Vectors pJM3 and pJM4 
(Figure 1) contain the lacPO promoter with the first 21 amino acids of lacZa 
(from pUC19) fused in-franie to the chb gene. These plasmids also contain the X phage 
attP recombination site in different orientations, the gene encoding chloramphenicol 
acetyltransferase (cat), and a ribosomal terminator, rrnBtltl, inserted upstream of the 
lac-chb fusion to prevent read-through from other promoters. The chitobiase activity 
associated with these plasmids (Table 1) is high in the absence of IPTG because of 
titration of lac repressor expressed from a single copy chromosomal gene. Induction by 
IPTG is approximately 10-fold (Table 1). 

Table 1. Chitobiase Activity of lacZ-chb Fusion^ 

Chitobiase^ (U") 

Plasmid "IPTG +IPTG (1 mM) 

pJMF3 in DH5a 668 ± 45 9320 ± 347 

pJMF4 in DH5a 788 ± 44 7188 ± 477 

^ Overnight cultures were diluted 1:1000 into 50 mL prewarmed LB and grown to 
OD450 = 0.1. 1 mM IPTG was added to half of the culture, and growth continued to 
OD450-O.3. 

^ Triplicate samples were assayed. Mean chitobiase activities are given with standard 
deviations. 

One unit of chitobiase activity is 1 pmol of p-nitrophenol/min at 28°C. Units given 
for 1 mL of culture at OD450 = I . 



The lac promoter can be replaced vritb another promoter and a fusion protein 
created with chitobiase by cutting with Sphl orAsel and either SaK, Kpnl, or Sstl 
(Figure 2). Fusions created with these vectors can be moved to the chromosome by site 
specific recombination at the X attB site to permit single copy analysis of the activity of 
the promoter. The protocol, as described more fully in Diederich, et al, "New cloning 
vectors for integration into the X attachment site attB of Escherichia coli chromosome," 
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Plasmid 2S: 14-24 (1992), involves two components, (i) a circular DNA containing the X 
attachment site, attP, the promoter-c/2& gene fusion, and the cat gene, and (ii) a helper 
plasmid, pLDR8, which contains the int gene under the control of the temperature- 
sensitive repressor, cI857 and a temperature-sensitive origin of replication. The 
5 plasmid is digested with Notl to remove the PI 5A origm, and the fragment containing 

the chitobiase fusion is self-ligated prior to transformation into cells containmg pLDR8. 
Integration occurs by site-specific recombination between attP and attB (17.4 min on 
the E. coli chromosome. Figure 3). 

Example 6 

0 Use of the Chitobiase Reporter Enzyme to Study d naA Gene regulation 

Two plasmids, pDYK9 and pDYKl 1, discussed in the Examples above, were 
constructed to assess the regulation of the dnaA gene using chitobiase as a reporter 
enzyme. These plasmids differ by the absence of the rp/n/f regulatory region in 
pDYK9 (Figure 4). These fusions were moved from the plasmid to the chromosomal 

15 attB site for single copy analysis as described above. After transformation of strain 
WM2269 with the ligated DNA, integration occurred by site specific recombination 
between the attP and the attB sites. The orientation of attP in pDYK9 and pDYKl 1 is 
such that when the fusion is integrated at attB, the transcription direction ofdmApl and 
dnaAp2 promoters is the same as movement of the replication fork. This orientation is 

20 the same as at the dnaA wild type location. The genetic location of the fusions was 

confirmed in the CtoF transformants by demonstrating cotransduction of Cm^ and galK. 

The fusions created in strain WM2269 were moved by PI transduction to 
MG1655, creating strainDYK9W withpDYK9 and strain DYKUW with pDYKll. 
Deletion of the rp/wff promoters had very little effect (1.4-fold) on chitobiase activity 

25 (Table 2, compare lines 1 and 3) 
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Table 2. Chitobiase and p-Gal Activities of dnaA-chb and dnaA-lacZ Fusions in 

WT and fis Mutant Backgrounds 



p-Gaf 


Strains 


Chitobiase (U ) 


^iviuier u ) 




30.0 ±2.1 




DYK11F^5::985 


80.0 ±1.2 




DYK9W 


44.0 ±2.1 




DYK9F y?5::985 


96.0 ±3.5 




RB220 




59.8 ±7.9 


TP220 fis::767 




115.2 ±7.2 


^ Triplicate samples were assayed during exponential growth. Mean chitobiase 
activities are given with standard deviation. 

One unit of chitobiase activity is 1 pmol of /?-nitrophenol per min at 28°C. Units 
given for 1 mL of culture at OD450 = 1 . 
Data from Reference 8. 
Unit defined in Reference 13. 



Fis protein binds to a site in the dnaAp2 promoter that covers the -35 sequence 
(Figure 5), and appears to be a repressor of DnaA expression. A fusion protein with 6- 
galactosidase activity that is expressed from the rpmH-dnoA regulatory region has 
increased 6-galactosidase activity (1 .9-fold) in a^r mutant when compared to Fis wild 
type cells [as described in Froelich, J. M., et al, "Fis binding in the dnoA operon 
promoter region," J. Bacteriol. 775:6006-6012 (1996), data shown in Table 3]. 
Similarly, the absence of Fis leads to a greater than 2-fold increase in chitobiase activity 
of the DnaA-chitobiase fusion protein for the DYK9F and DYKl IF strains, comparable 
in extent to that observed with the dnaA-lacZ fusion strain, TP220 (Table 3). 

In the reporter gene constructs discussed in the Example, all upstream 
transcriptional activity was prevented from entering the chb reporter gene. The plasmid 
vectors, pJMF3 and pJMF4, described in the Examples above, contained the rrnhtlt2 
terminator upstream of the promoter fusion, which prevented readthrough from 
chromosomal promoters near the insertion site. Only chitobiase activity originating 
from the promoters of interest was expressed. 
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The attP site in these vectors allowed integration at the chromosomal attB in a 
specific orientation, depending on the vector used. With these vectors, any chitobiase 
fusion involving an essential gene can be moved to the chromosome, thus permitting 
single copy analysis with a chromosomal orientation similar to the wild-type gene. 

Example 7 

Identification of Promoters in Test Sequences 
A nucleic acid prospectively containing a promoter is inserted upstream of a 
nucleic acid encoding a cytoplasmic form of chitobiase as described above. For example, 
the nucleic acid prospectively containing a promoter may be inserted into a restriction site 
in a sequence containing a plurality of restriction sites, such as a polylinker, which is 
located upstream of the nucleic acid encoding chitobiase. The test sequence may comprise 
any nucleic acid to be tested for promoter activity. In one embodiment, the test sequence 
may comprise a genomic DNA sequence. For example, the genomic DNA sequence may 
be a randomly generated DNA fragment, such as a fragment generated using shotgun 
cloning techniques, a restriction fragment, or any other sequence. 

The vectors containing the test sequence upstream of the nucleic acid encoding 
chitobiase are introduced into an appiopriate host cell. The level of chitobiase activity is 
assayed and compared to the level obtained from a control vector which lacks an insert in 
the cloning site. The presence of an elevated expression level in cell containing the vector 
containing the insert with respect to the level in cells containing the control vector without 
the insert indicates tiie presence of a promoter in the insert. 

In some embodiments, the activity of the promoter in the test sequence may be 
assayed after exposure of the host cells to conditions which may influence ttie level of 
transcription from the promoter. For example, the environment of the host cells may be 
altered to determine whether the transcription level is influenced by environmental factors, 
including factors such as temperature, pH, nutrients, or availability of oxygen. In such 
analyses, chitobiase levels are assayed under a variety of environmental conditions to 
determine the effects of the environmental conditions on transcription levels fi'om the 
promoter. In addition, the activity of the promoters may be examined in the presence or 
absence of compounds to be tested for regulatory activity. For example, the activity of the 
promoters may be tested by determining the levels of chitobiase produced in the presence 
or absence of compounds to be tested for activity as drugs. 
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Promoter sequences Avithin the test sequences may be further defined by 
constructing nested deletions in the test sequences using conventional techniques such as 
Exonuclease III digestion. The resulting deletion fragments can be inserted into the 
promoter reporter vector to determine whether the deletion has reduced or obliterated 

5 promoter activity as determined by measuring chitobiase activity in cells containing the 

deletion vectors. In this way, the boundaries of tlie promoters may be defined. If desired, 
potential individual regulatory sites within the promoter may be identified using 
techniques such as site directed mutagenesis or linker scanning to obliterate potential 
transcription factor binding sites within the promoter individually or in combination. The 

10 effects of these mutations on transcription levels may be determined by inserting the 

mutations into the cloning sites in the promoter reporter vectors and measuring the levels 
of chitobiase produced from the mutated promoters. 

The activity of known promoters may also be monitored by operably linking them 
to a nucleic acid encoding a cytoplasmic form of chitobiase. The activity of the promoters 

15 may be analyzed under various environmental conditions as described above. In addition, 
the activity of the promoters may be analyzed in the presence or absence of compounds to 
be tested for the ability to affect transcription from the promoters. For example, the 
compounds may be tested for activity as drags. 

In some embodiments, the chitobiase reporter constructs may be used in systems 

20 for identifying compounds that modulate cell surface protein-mediated activity or 

compounds which modulate the activities of intracellular signaling systems. Techniques 
for using reporter genes to identify compounds which modulate cell surface protein- 
mediated activity have been described in U.S. Patent Number 5,401,629 and U.S. Patent 
Number 5,436,128, flie disclosures of which are incorporated herein by reference in their 

25 entireties. Briefly, in such methods, a construct comprising a promoter operably linked to 
a nucleic acid encoding a reporter enzyme is introduced into cells which express the cell 
surface protein and cells which do not express the cell surface protein. Each of the cells 
are contacted with test compounds and the effects of these compounds on transcription 
levels is measured by determining the level of activity of the reporter enzyme. The level 

30 of expression of the reporter gene in cells expressing the cell surface protein is compared 

to the level in cells which do not express the cell surface protein to identify compounds 
that modulate cell surface protein activity. 
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Similarly, the chitobiase reporter constructs may be used to identify compovinds 
which influence the activity of intracellular signaling pathways, such as cAMP-based or 
phosphorylation-based pathways. In such methods, a promoter which is activated via such 
pathways is operably hnked to a nucleic acid encoding a cytoplasmic form of chitobiase. 
5 The cells are contacted with test compounds. Those compounds which activate the 

pathway to which the promoter responds will produce an enhanced level of chitobiase 
activity in the cells as compared to the level of chitobiase activity in control cells which 
have not been contacted with the test compound. 

Example 8 

10 Detecting Successful Transformation or Transfection Using Chitobiase 

A vector comprising a sequence encoding a cytoplasmic form of chitobiase 
operably linked to a sequence capable of directing transcription of the chitobiase gene is 
introduced into a host cell. The host cells are contacted with a chitobiase substrate and 
those host cells which contain chitobiase activity are identified as cells which were 
15 successfully transformed or transfected. In some embodiments, a portion or replica of a 
colony may be lysed or permeabilized prior and the lysate or permeabilized cells may 
be contacted with the chitobiase substrate. 

CONCLUSION 

New gene reporter systems that use chitobiase have been described. Chitobiase 
20 has advantages over other reporter gene systems in that chitobiase is not found in many 

cell lines traditionally used in reporter gene systems. 

Finally, the forgoing examples are not intended to hmit the scope of the present 
invention, which is set forth in the following claims. In particular, various equivalents 
and substitutions will be recognized by those of ordinary skill in the art in view of the 
25 foregoing disclosure, and these are contemplated to be within the scope of the present 

invention. All references cited herein are incorporated herein by reference in their 
entireties. 
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WHAT IS CLAIMED IS : 

1 . A method for characterizing a promoter comprising: 

providing a construct comprising said promoter operably linked to a 
nucleic acid encoding a cytoplasmic form of chitobiase; 
introducing the construct into host cells; and 

identifying sequences in said promoter which regulate transcription 

levels, 

2. The method of Claim 1, wherein said cytoplasmic form of chitobiase lacks a 
signal sequence. 

3. The method of Claim 2, wherein said nucleic acid encoding a cytoplasmic 
form of chitobiase encodes a fusion protein, said fusion protein comprising a 
cytoplasmic form of chitobiase fused to a heterologous polypeptide. 

4. The method of Claim 1, wherein said nucleic acid encoding a cytoplasmic 
form encodes a cytoplasmic form of chitobiase obtained from an organism selected 
from the group consisting of Alteromonas sp. 0-7, Arabidopsis thaliana, Bacillus 
subtilis, Bombyx mori, Bos taurus, Caenorhabditis elegans, Candida albicans, 
Dictyostelium discoideum, Entamoeba histolytica, Felis catus, Homo sapiens, Korat 
cats, Lactobacillus casei, Leishmania donovani, Mus musculus, Pisum sativum, 
Porphyromonas gingivalis, Pseudoalteromonas sp, S9, Rattus norvegicus, Serratia 
marcescens, Streptomyces plicatus, Streptomyces thermoviolaceus, Sus scrofa, 
Trichoderma harzianum, Vibrio furmssii. Vibrio harveyi , Vibrio parahaemolyticus, and 
Vibrio vulnificus. 

5. The method of Claim 1 , wherein said method of identifyuig sequences which 
are involved in dnecting transcription comprises mutagenizing said promoter. 

6. The method of Claim 1 , wherein said method of identifying sequences which 
are involved in transcription comprises constructing deletions in said promoter. 

7. A method for identifying a regulatory element capable of directing or 
regulating transcription within a test nucleic acid sequence comprising: 

providing a construct comprising said test nucleic acid sequence 
operably linked to a nucleic acid encoding a cytoplasmic form of chitobiase; 
introducing said construct into host cells; and 
determining the level of chitobiase activity. 
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8. The method of Claim 7, wherein said cytoplasmic form of chitobiase lacks a 
signal sequence. 

9. The method of Claim 8, wherein said nucleic acid encoding a cytoplasmic 
form of chitobiase encodes a fusion protein, said fusion protein comprising a 
cytoplasmic form of chitobiase fused to a heterologous polypeptide. 

10. The method of Claim 7, wherein said nucleic acid encoding a cytoplasmic 
form encodes a cytoplasmic form of chitobiase obtained firora an organism selected 
ftom the group consisting of Alter omonas sp. 0-7, Arabidopsis thaliana, Bacillus 
suhtilis, Bombyx mori, Bos taurus, Caenorhabditis elegans, Candida albicans, 
Dictyostelium discoideum, Entamoeba histolytica, Felis catus, Homo sapiens, Korat 
cats, Lactobacillus casei Leishmania donovani, Mus musculus, Pisum sativum, 
Porphyromonas gingivalis, Pseudoalter omonas sp. S9, Rattus norvegicus, Serratia 
marcescens, Streptomyces plicatus, Streptomyces thermoviolaceus, Sus scrofa, 
Trichoderma harzianum. Vibrio furnissii. Vibrio harveyi , Vibrio parahaemolyticus, and 
Vibrio vulnificus. 

11. The method of Claim 7, wherem said reporter gene construct is introduced 
transiently. 

12. The method of Claim 7, wherein said reporter gene construct is mtroduced 

stably. 

13. The method of Claim 7, wherein said host cells are selected fi-om the group 
consisting of prokaryotic cells and eukaryotic cells. 

14. The method of Claim 7, further comprising permeabilizing or lysing said 

host cells. 

15. The method of Claim 14, wherein said permeabilizing or lysing step 
comprises treating said host cells with toluene. 

16. The method of Claim 7, wherein said step of determining the level of 
chitobiase activity is selected from the group consisting of measuring the amount of a 
chemiluminescent product produced from a substrate, measuring the amount of a 
fluorescent product produced from a substrate, measuring the amount of light absorbed 
by a product produced from a substrate and measuring a decrease in the amount of a 
detectable substrate. 
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17. The method of Claim 7, wherein said step of deteraiining the level of 
chitiobiase activity comprises determinmg the level of /j-nitrophenol released from a 
substrate. 

18. The method of Claim 7, wherein said test nucleic acid sequence comprises a 
portion of genomic DNA. 

19. The method of Claim 1, wherein said step of determining the level of 
chitobiase activity comprises determining the level of chitobiase activity after exposing 
said host cells to a desired set of environmental conditions. 

20. The method of Claim 7, wherein said step of determining the level of 
chitobiase activity comprises determining the level of chitobiase activity after 
contacting said host cells with a compound to be tested for its influence on the level of 
transription from siad regulartory element. 

21. A method of detecting successfiil transformation, comprising the steps of: 
introducing a nucleic acid encoding a cytoplasmic form of chitobiase into host 

cells; and 

detecting chitobiase expression in said host cells. 

22. A fusion protein-reporter gene construct comprising a promoter operably 
linked to a nucleic acid encoding a cytoplasmic form of chitobiase fused in frame with a 
nucleic acid encoding a heterologous polypeptide, wherein said heterologous 
polypeptide is not p-galactosidase or a portion thereof and wherein said heterologous 
polypeptide does not contain a signal peptide. 

23. The nucleic acid of Claim 22, wherein said nucleic acid encodes a 
cytoplasmic form of chitobiase obtained from an organism selected from the group 
consisting of Alteromonas sp. 0-7, Arabidopsis thaliana. Bacillus subtilis, Bombyx 
mori, Bos taurus, CaenorhahdUis elegans, Candida albicans, Dictyostelium 
discoideum, Entamoeba histolytica, Felis catus. Homo sapiens, Korat cats, 
Lactobacillus casei, Leishmania donovani, Mus musculus, Pisum sativum, 
Porphyromonas gingivalis, Pseudoalteromonas sp. S9, Rattus norvegicus, Serratia 
marcescens, Streptomyces plicatus, Streptomyces thermoviolaceus, Sus scrofa, 
Trichoderma harzianum. Vibrio furnissii, Vibrio harveyi , Vibrio par ahaemolyticus, and 
Vibrio vulnificus. 
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24. The nucleic acid of Claim 22, further comprising a X . site-specific 
recombination sequence. 

25. A reporter gene construct comprising piasmid pJMF3. 

26. A reporter gene construct comprising piasmid pJMF4. 

27. A reporter gene construct comprising piasmid pDYK9. 

28. A reporter gene construct comprising piasmid pDYKl 1 . 

29. A host cell comprising the construct of Claim 22. 

30. The host cell of Claim 29 wherein said nucleic acid is integrated into a 
chromosome of said cell. 

31. The host cell of Claim 29, wherein said nucleic acid is transiently expressed 

in said host cell. 

32. A nucleic acid encoding a cytoplasmic form of chitobiase in which the signal 
sequence of native chitobiase has been inactivated or deleted. 

33. The nucleic acid of Claim 32, wherein the signal sequence has been mutated 
to inactivate it. 

34. An isolated or purified polypeptide comprising a cytoplasmic form of 
chitobiase fused in frame with a heterologous polypeptide, wherein said heterologous 
polypeptide is not p-galactosidase or a portion thereof and wherein said heterologous 
polypeptide does not contain a signal peptide. 

35. An isolated or purified polypeptide comprising a cytoplasmic form of 
chitobiase in which the signal peptide of native chitobiase has been inactivated or 
deleted. 

36. The polypeptide of Claim 35, wherein the signal sequence has been mutated 
to inactivate it. 

37. A method for monitoring the activity of a promoter comprising: 

providing a construct comprising said promoter operably linked to a 
nucleic acid encoding a cytoplasmic form of chitobiase; 
introducing said construct into host cells; and 
determining the level of chitobiase activity. 

38. The method of Claim 37, wherein said cytoplasmic form of chitobiase lacks 
a signal sequence. 
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39. The method of Claim 38, wherem said nucleic acid encoding a cytoplasmic 
form of chitobiase encodes a fusion protein, said fusion protein comprising a 
cytoplasmic form of chitobiase fused to a heterologous polypeptide. 

40. The method of Claim 37, wherein said nucleic acid encoding a cytoplasmic 
form encodes a cytoplasmic form of chitobiase obtained firom an organism selected 
from the group consisting of Alteromonas sp. 0-7, Arabidopsis thaliana, Bacillus 
subtilis, Bombyx mori, Bos taurus, Caenorhabditis elegans, Candida albicans, 
Dictyostelium discoideum, Entamoeba histolytica, Felis catus, Homo sapiens, Korat 
cats, Lactobacillus casei, Leishmania donovani, Mus musculus, Pisum sativum, 
Porphyromonas gingivalis, Pseudoalteromonas sp. S9, Rattus norvegicus, Serratia 
marcescens, Streptomyces plicatus, Sti-eptomyces thermoviolaceus, Sus scrofa, 
Trichoderma harzianum, Vibrio fiirnissii. Vibrio harveyi , Vibrio par ahaemolyticus, and 
Vibrio vulnificus. 

41. The method of Claim 37, wherein said reporter gene construct is introduced 
transiently. 

42. The method of Claim 37, wherein said reporter gene construct is introduced 

stably. 

43. The method of Claim 37, wherein said host cells are selected from the group 
consisting of prokaryotic cells and eukaryotic cells. 

44. The method of Claim 37, further comprising permeabilizing or lysing said 

host cells. 

45. The method of Claim 44, wherein said permeabilizing or lysuig step 
comprises treating said host cells with toluene. 

46. The method of Claim 37, wherein the step of determining the level of 
chitobiase activity is selected from the group consisting of measuring the amount of a 
chemiluminescent product produced from a substrate, measuring the amount of a 
fluorescent product produced from a substrate, measuring the amount of Ught absorbed 
by a product produced from a substrate and measuring a decrease in the amount of a 
detectable substrate. 

47. The method of Claim 37, wherein said step of determining the level of 
chitiobiase activity comprises determining the level of ;7-nitrophenol released from a 
substrate. 
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48. The method of Claim 37, wherein said step of determining the level of 
chitobiase activity comprises determining the level of chitobiase activity after exposing 
said host cells to a desired set of environmental conditions. 

49. The method of Claim 37, wherein said step of determming the level of 
chitobiase activity comprises determining the level of chitobiase activity after 
contacting said host cells with a compound to be tested for its influence on the level of 
transription from siad regulartory element. 

50. The method of Claim 49, wherein said compound comprises a compound to 
be tested for activity as a drug. 
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CHITOBIASE AS A REPORTER ENZYME 

Abstract of the Disclosure 
The present invention relates to reporter gene constructs encoding a cytoplasmic 
form of chitobiase (N,N'-diacetylchitobiase) and methods of using these reporter gene 
constructs. The use of a cytoplasmic form of chitobiase as a reporter enzyme is 
generally applicable to the study of gene expression in organisms which do not contain 
N-acetyl-6-D-glucosaminidases. 
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Figure 1 



SEQ ID NO: 15: 



Sphl 



Asel 



CAP-cAMP 
binding site 



-35 



-10 



I 1 I 

GCATGCATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTT 



+1 



Lac Repressor 
binding site 



Sphl 



Sail 



"aattgtgagcggataacaatttcacacaggaaact^gctatgaccatgattacgccaagcttgc^ 

MetThrMetlleThrProSerLeuHisAlaCysArgSerThr 



Kpnl Sstl 

ctagaggatccccgggtaccg/agctctgagcaacaagttgtaaactcactggctgataaccttgatatcca^ 

LeuGliiAspProArgValPro/SerSerGluGlnGlnValValAsnSerLeiaAlaAspAsnLeuAspIleGlnTyrGlu 
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MetSerLeuSerLeuTrpGlnGlnCysLeuAlaArgLeuGlnAspGluLeu/ValPro/ SerSerGlu. . . 

SEQ ID NO: 17 

Figure 5 



SEQUENCE LISTING 



<110> Zyskind, Judith W. 

<120> CHITOBIASE AS A REPORTER ENZYME 

<130> ELITRA.006A 

<150> 60/159,221 

<151> 1999-10-13 

<160> 19 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 1 

gcacatgcat gctggtcatt aaattttcc 

<210> 2 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 2 

cggggtacca actcatcctg caatcg 

<210> 3 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 3 

ctagtctaga tgccgaactc agaagtga 

<210> 4 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
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<400> 4 

gcacatgcat gcgggggatg gcttgtagat 
<210> 5 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 5 

catgatcatg cgacaggttt gatga 

<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 6 

gggggcgcct acctttcacg ag 

<210> 7 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 7 

catgcatgca tgaaacgatg gacacc 

<210> 8 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 8 

cggggtacca actcatcctg caatcg 

<210> 9 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<400> 9 
cattaatgca tg 



<210> 10 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 10 

cattaatgca tg 12 

<210> 11 
<211> 5826 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pDYK9 
<400> 11 

gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60 

gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120 

ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180 

tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240 

aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300 

ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360 

ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 420 

ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 4 80 

gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 540 

acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtgcggc 600 

cgcactggct tactatgttg gcactgatga gggtgtcagt gaagtgcttc atgtggcagg 660 

agaaaaaagg ctgcaccggt gcgtcagcag aatatgtgat acaggatata ttccgcttcc 720 

tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat ggcttacgaa 780 

cggggcggag atttcctgga agatgccagg aagatactta acagggaagt gagagggccg 840 

cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa atctgacgct 900 

caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt cccctggcgg 960 

ctccctcgtg cgctctcctg ttcctgcctt tcggtttacc ggtgtcattc cgctgttatg 1020 

gccgcgtttg tctcattcca cgcctgacac tcagttccgg gtaggcagtt cgctccaagc 1080 

tggactgtat gcacgaaccc cccgttcagt ccgaccgctg cgccttatcc ggtaactatc 1140 

gtcttgagtc caacccggaa agacatgcaa aagcaccact ggcagcagcc actggtaatt 1200 

gatttagagg agttagtctt gaagtcatgc gccggttaag gctaaactga aaggacaagt 1260 

tttggtgact gcgctcctcc aagccagtta cctcggttca aagagttggt agctcagaga 1320 

accttcgaaa aaccgccctg caaggcggtt ttttcgtttt cagagcaaga gattacgcgc 1380 

agaccaaaac gatctcaaga agatcatctt atgcggccgc atcagataaa atatttctag 1440 

atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc tccccatgcg 1500 

agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag actgggcctt 1560 

tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc cgccgggagc 1620 

ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc cgccataaac 1680 

tgccaggcat caaattaagc agaaggccat cctgacggat ggcctttttg cgtttctaca 174 0 

aactcttcct gtcgtcatat ctacaagcca tcccccgcat gctggtcatt aaattttcca 1800 

atatgcggcg taaatcgtgc ccgcctcgcg gcaggatcgt ttacacttag cgagttctgg 1860 

aaagtcctgt ggataaatcg ggaaaatctg tgagaaacag aagatctctt gcgcagttta 1920 

ggctatgatc cgcggtcccg atcgttttgc aggatcttga tcgggcatat aaccgcagac 1980 

agcggttcgt gcgtcaccct caagcagggt cttttcgacg tacgtcaaca atcatgaatg 2040 

tttcagcctt agtcattatc gacttttgtt cgagtggagt ccgccgtgtc actttcgctt 2100 

tggcagcagt gtcttgcccg attgcaggat gagttggtac cgagctctga gcaacaagtt 2160 
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gtaaactcac tggctgataa ccttgatatc caatatgaag tgttaactaa ccatggtgct 2220 

aacgaaggtc ttgcgtgcca agatatgggc gcagaatggg cttcttgtaa caaagtaaac 2280 

atgacgcttg ttaaccaagg tgaagctgtt gactcaaaag attgggctat ttacttccac 2340 

agcattcgtc tgattctgga tgttgacaac gagcagttca aaatctctcg tgtaacgggt 2400 

gacctacata agctagaacc aacagataag tttgacggct tcgctgccgg tgaagaggtt 2 4 60 

gttcttccat tggttggtga atactggcaa ctatttgaaa ctgacttcat gccgggtgca 2520 

ttcgtttctg ctccaaacgc agaacctaag atgattgctt ctctaaatac tgaagatgtt 2580 

gcgtcttttg tgacgggtct tgaaggtaac aacctaaaac gtacaccaga tgacaacaat 2640 

gtatttgcaa acgctgtgtc tcgttttgag aaaaacgaag acctagcaac acaagacgta 2700 

tcaaccacgt tactaccaac accaatgcac gttgaagcgg gtaaaggcaa agtagatatc 2760 

gcggatggta ttgcgctgcc taaagacgca ttcgatgcga ctcagttcgc agcgattcaa 2820 

gatcgtgcag aagtggtagg tgtggacgtt cgtggtgatc ttcctgtaag catcactgtt 2880 

gttcctgcag acttcaccgg tgaattagca aaatctggtg cttacgaaat gagcatcaaa 2940 

ggcgacggta ttgtgattaa agcgttcgac caagcaggcg ctttctacgc agtacaatct 3000 

atctttggcc tggtagatag ccaaaatgct gattctctac cacaactgtc tattaaagat 3060 

gcgcctcgtt ttgattaccg tggtgtgatg gtggatgtgg ctcgtaactt ccactctaag 3120 

gacgcaatcc ttgcaacgct agaccaaatg gcagcgtaca agatgaacaa acttcacctt 3180 

cacctaaccg atgatgaagg ctggcgttta gaaatcccgg gtctgcctga gctgacagaa 3240 

gtgggtgcta accgttgttt cgatacacaa gagaaaagct gtttactgcc tcagcttggc 3300 

tctggtccaa cgacagacaa ctttggctct ggctacttca gcaaagcaga ctacgtggaa 3360 

atcttgaaat acgcgaaagc acgtaacatt gaagtgattc cagaaatcga tatgccagct 3420 

cacgctcgtg cagcagtagt atcaatggaa gctcgttacg accgcctaat ggaagaaggt 3480 

aaagaagctg aagcgaacga ataccgtctg atggatcctc aagatacatc aaacgtaacg 3540 

acggttcagt tctacaataa gcaaagcttc atcaacccat gtatggaatc ttcaactcgc 3600 

tttgttgata aggtgatttc agaagtggca gcaatgcacc aagaagctgg cgctccacta 3660 

acaacttggc acttcggtgg tgacgaagcg aagaacatca agctaggtgc tggtttccaa 3720 

gacgttaacg cagaagataa agtaagctgg aaaggcacga ttgacctgtc taaacaagac 3780 

aagccgtttg cacagtctcc acaatgtcag acgctaatca cagatggcac agtcagtgac 3840 

tttgctcacc taccaagcca cttcgcggaa gaagtgtcga agattgttgc tgagaaaggc 3900 

attccaaact tccaagcttg gcaagatggt ttgaaataca gtgacggcga aaaagcgttc 3960 

gctacagaaa atactcgcgt aaacttctgg gacgttctgt actggggcgg tacttcctca 4 020 

gtgtacgagt ggtctaagaa aggttacgac gtgattgttt ctaacccaga ttacgtgtac 4080 

atggatatgc catacgaagt tgacccgaaa gagcgtggtt actactgggc aacacgtgca 4140 

acggatactc gtaagatgtt tggctttgca ccagagaaca tgcctcaaaa cgcagaaact 4200 

tctgtagatc gcgatggcaa tggctttact ggtaaaggtg aaatcgaagc gaaacctttc 4260 

tacggtctat ctgcacaact ttggtctgag acagtacgta acgacgagca atacgagtac 4320 

atggtattcc ctcgcgtcct cgctgctgct cagcgtgcat ggcaccgtgc tgactgggaa 4380 

aacgactaca aagttggtgt tgagtactcg caaaactcta atctagttga taaagcatcg 4440 

ctaaaccaag actacaaccg ctttgcgaac gtacttggtc aacgtgaact ggctaagcta 4500 

gaaaaatcag gtattgacta ccgcctacca gtaccaggtg caaaagtaga agatggtaag 4560 

ctagcaatga acgttcagtt ccctggcgta acgcttcaat actctctgga tggtgagaac 4 620 

tggttgactt atgcagacaa cgctcgtcca aatgtaactg gtgaagtctt catccgctcg 4 680 

gtatctgcga caggtgagaa ggtaagccgt atcactagcg tgaaataata gcgctcagta 4740 

ttcactaaaa tcatagttcc ttactcaaag ccctcaactt atgttggggg ctttgtttat 4800 

ttttcttcgg aaaataagcg tgatcatgcg acaggtttga tgacaaaaaa ttagcgcaag 48 60 

aagacaaaaa tcaccttgcg ctaatgctct gttacaggtc actaatacca tctaagtagt 4 920 

tgattcatag tgactgcata tgttgtgttt tacagtatta tgtagtctgt tttttatgca 4 980 

aaatctaatt taatatattg atatttatat cattttacgt ttctcgttca gcttttttat 5040 

actaagttgg cattataaaa aagcattgct tatcaatttg ttgcaacgaa caggtcacta 5100 

tcagtcaaaa taaaatcatt atttgatttc aattttgtcc cactccctgc ctctgtcatc 5160 

acgatactgt gatgccatgg tgtccgactt atgcccgaga agatgttgag caaacttatc 5220 

gcttatctgc ttctcataga gtcttgcaga caaactgcgc aactcgtgaa aggtaggcgc 5280 

ccccggtcga atttgctttc gaatttctgc cattcatccg cttattatca cttattcagg 5340 

cgtagcacca ggcgtttaag ggcaccaata actgccttaa aaaaattacg ccccgccctg 5400 

ccactcatcg cagtactgtt gtaattcatt aagcattctg ccgacatgga agccatcaca 5460 

gacggcatga tgaacctgaa tcgccagcgg catcagcacc ttgtcgcctt gcgtataata 5520 

tttgcccatg gtgaaaacgg gggcgaagaa gttgtccata ttggccacgt ttaaatcaaa 5580 
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actggtgaaa ctcacccagg gattggctga gacgaaaaac atattctcaa taaacccttt 
agggaaatag gccaggtttt caccgtaaca cgccacatct tgcgaatata tgtgtagaaa 
ctgccggaaa tcgtcgtggt attcactcca gagcgatgaa aacgtttcag tttgctcatg 
gaaaacggtg taacaagggt gaacactatc ccatatcacc agctcaccgt ctttcattgc 
catacg 



5640 
5700 
5760 
5820 
5826 



<210> 12 
<211> 6071 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pDYKll 
<400> 12 

gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60 

gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120 

ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180 

tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240 

aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300 

ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360 

ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 420 

ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 480 

gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 540 

acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtgcggc 600 

cgcactggct tactatgttg gcactgatga gggtgtcagt gaagtgcttc atgtggcagg 660 

agaaaaaagg ctgcaccggt gcgtcagcag aatatgtgat acaggatata ttccgcttcc 720 

tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat ggcttacgaa 780 

cggggcggag atttcctgga agatgccagg aagatactta acagggaagt gagagggccg 840 

cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa atctgacgct 900 

caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt cccctggcgg 960 

ctccctcgtg cgctctcctg ttcctgcctt tcggtttacc ggtgtcattc cgctgttatg 1020 

gccgcgtttg tctcattcca cgcctgacac tcagttccgg gtaggcagtt cgctccaagc 1080 

tggactgtat gcacgaaccc cccgttcagt ccgaccgctg cgccttatcc ggtaactatc 1140 

gtcttgagtc caacccggaa agacatgcaa aagcaccact ggcagcagcc actggtaatt 1200 

gatttagagg agttagtctt gaagtcatgc gccggttaag gctaaactga aaggacaagt 1260 

tttggtgact gcgctcctcc aagccagtta cctcggttca aagagttggt agctcagaga 1320 

accttcgaaa aaccgccctg caaggcggtt ttttcgtttt cagagcaaga gattacgcgc 1380 

agaccaaaac gatctcaaga agatcatctt atgcggccgc atcagataaa atatttctag 1440 

atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc tccccatgcg 1500 

agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag actgggcctt 1560 

tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc cgccgggagc 1620 

ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc cgccataaac 1680 

tgccaggcat caaattaagc agaaggccat cctgacggat ggcctttttg cgtttctaca 1740 

aactcttcct gtcgtcatat ctacaagcca tcccccgcat gcatgaaacg atggacaccg 1800 

aagccatggg tgattaaaga ggccggattg taataattgt acactccgga gtcaattctc 1860 

tttccttatt taccgcgctt ttccgcacct tttcgcaggg aaaatgtacg acctcacacc 1920 

agtggaaacc agcatggcgc gccgggtgga ggattatacg ggctgatggg taaagcgcaa 1980 

ggatcgtcct ggatctttat tagatcgatt aagccaattt ttgtctatgg tcattaaatt 2040 

ttccaatatg cggcgtaaat cgtgcccgcc tcgcggcagg atcgtttaca cttagcgagt 2100 

tctggaaagt cctgtggata aatcgggaaa atctgtgaga aacagaagat ctcttgcgca 2160 

gtttaggcta tgatccgcgg tcccgatcgt tttgcaggat cttgatcggg catataaccg 2220 

cagacagcgg ttcgtgcgtc accctcaagc agggtctttt cgacgtacgt caacaatcat 2280 

gaatgtttca gccttagtca ttatcgactt ttgttcgagt ggagtccgcc gtgtcacttt 234 0 

cgctttggca gcagtgtctt gcccgattgc aggatgagtt ggtaccgagc tctgagcaac 2400 

aagttgtaaa ctcactggct gataaccttg atatccaata tgaagtgtta actaaccatg 2460 

gtgctaacga aggtcttgcg tgccaagata tgggcgcaga atgggcttct tgtaacaaag 2520 
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taaacatgac gcttgttaac caaggtgaag ctgttgactc aaaagattgg gctatttact 2580 

tccacagcat tcgtctgatt ctggatgttg acaacgagca gttcaaaatc tctcgtgtaa 264 0 

cgggtgacct acataagcta gaaccaacag ataagtttga cggcttcgct gccggtgaag 2700 

aggttgttct tccattggtt ggtgaatact ggcaactatt tgaaactgac ttcatgccgg 2760 

gtgcattcgt ttctgctcca aacgcagaac ctaagatgat tgcttctcta aatactgaag 2820 

atgttgcgtc ttttgtgacg ggtcttgaag gtaacaacct aaaacgtaca ccagatgaca 2880 

acaatgtatt tgcaaacgct gtgtctcgtt ttgagaaaaa cgaagaccta gcaacacaag 2940 

acgtatcaac cacgttacta ccaacaccaa tgcacgttga agcgggtaaa ggcaaagtag 3000 

atatcgcgga tggtattgcg ctgcctaaag acgcattcga tgcgactcag ttcgcagcga 3060 

ttcaagatcg tgcagaagtg gtaggtgtgg acgttcgtgg tgatcttcct gtaagcatca 3120 

ctgttgttcc tgcagacttc accggtgaat tagcaaaatc tggtgcttac gaaatgagca 3180 

tcaaaggcga cggtattgtg attaaagcgt tcgaccaagc aggcgctttc tacgcagtac 3240 

aatctatctt tggcctggta gatagccaaa atgctgattc tctaccacaa ctgtctatta 3300 

aagatgcgcc tcgttttgat taccgtggtg tgatggtgga tgtggctcgt aacttccact 3360 

ctaaggacgc aatccttgca acgctagacc aaatggcagc gtacaagatg aacaaacttc 3420 

accttcacct aaccgatgat gaaggctggc gtttagaaat cccgggtctg cctgagctga 34 80 

cagaagtggg tgctaaccgt tgtttcgata cacaagagaa aagctgttta ctgcctcagc 354 0 

ttggctctgg tccaacgaca gacaactttg gctctggcta cttcagcaaa gcagactacg 3600 

tggaaatctt gaaatacgcg aaagcacgta acattgaagt gattccagaa atcgatatgc 3660 

cagctcacgc tcgtgcagca gtagtatcaa tggaagctcg ttacgaccgc ctaatggaag 3720 

aaggtaaaga agctgaagcg aacgaatacc gtctgatgga tcctcaagat acatcaaacg 3780 

taacgacggt tcagttctac aataagcaaa gcttcatcaa cccatgtatg gaatcttcaa 3840 

ctcgctttgt tgataaggtg atttcagaag tggcagcaat gcaccaagaa gctggcgctc 3900 

cactaacaac ttggcacttc ggtggtgacg aagcgaagaa catcaagcta ggtgctggtt 3960 

tccaagacgt taacgcagaa gataaagtaa gctggaaagg cacgattgac ctgtctaaac 4020 

aagacaagcc gtttgcacag tctccacaat gtcagacgct aatcacagat ggcacagtca 4080 

gtgactttgc tcacctacca agccacttcg cggaagaagt gtcgaagatt gttgctgaga 4140 

aaggcattcc aaacttccaa gcttggcaag atggtttgaa atacagtgac ggcgaaaaag 4200 

cgttcgctac agaaaatact cgcgtaaact tctgggacgt tctgtactgg ggcggtactt 4260 

cctcagtgta cgagtggtct aagaaaggtt acgacgtgat tgtttctaac ccagattacg 4320 

tgtacatgga tatgccatac gaagttgacc cgaaagagcg tggttactac tgggcaacac 4380 

gtgcaacgga tactcgtaag atgtttggct ttgcaccaga gaacatgcct caaaacgcag 4440 

aaacttctgt agatcgcgat ggcaatggct ttactggtaa aggtgaaatc gaagcgaaac 4500 

ctttctacgg tctatctgca caactttggt ctgagacagt acgtaacgac gagcaatacg 4560 

agtacatggt attccctcgc gtcctcgctg ctgctcagcg tgcatggcac cgtgctgact 4 620 

gggaaaacga ctacaaagtt ggtgttgagt actcgcaaaa ctctaatcta gttgataaag 4 680 

catcgctaaa ccaagactac aaccgctttg cgaacgtact tggtcaacgt gaactggcta 4740 

agctagaaaa atcaggtatt gactaccgcc taccagtacc aggtgcaaaa gtagaagatg 4800 

gtaagctagc aatgaacgtt cagttccctg gcgtaacgct tcaatactct ctggatggtg 4860 

agaactggtt gacttatgca gacaacgctc gtccaaatgt aactggtgaa gtcttcatcc 4920 

gctcggtatc tgcgacaggt gagaaggtaa gccgtatcac tagcgtgaaa taatagcgct 4980 

cagtattcac taaaatcata gttccttact caaagccctc aacttatgtt gggggctttg 5040 

tttatttttc ttcggaaaat aagcgtgatc atgcgacagg tttgatgaca aaaaattagc 5100 

gcaagaagac aaaaatcacc ttgcgctaat gctctgttac aggtcactaa taccatctaa 5160 

gtagttgatt catagtgact gcatatgttg tgttttacag tattatgtag tctgtttttt 5220 

atgcaaaatc taatttaata tattgatatt tatatcattt tacgtttctc gttcagcttt 5280 

tttatactaa gttggcatta taaaaaagca ttgcttatca atttgttgca acgaacaggt 5340 

cactatcagt caaaataaaa tcattatttg atttcaattt tgtcccactc cctgcctctg 5400 

tcatcacgat actgtgatgc catggtgtcc gacttatgcc cgagaagatg ttgagcaaac 54 60 

ttatcgctta tctgcttctc atagagtctt gcagacaaac tgcgcaactc gtgaaaggta 5520 

ggcgcccccg gtcgaatttg ctttcgaatt tctgccattc atccgcttat tatcacttat 5580 

tcaggcgtag caccaggcgt ttaagggcac caataactgc cttaaaaaaa ttacgccccg 5640 

ccctgccact catcgcagta ctgttgtaat tcattaagca ttctgccgac atggaagcca 5700 

tcacagacgg catgatgaac ctgaatcgcc agcggcatca gcaccttgtc gccttgcgta 57 60 

taatatttgc ccatggtgaa aacgggggcg aagaagttgt ccatattggc cacgtttaaa 5820 

tcaaaactgg tgaaactcac ccagggattg gctgagacga aaaacatatt ctcaataaac 5880 

cctttaggga aataggccag gttttcaccg taacacgcca catcttgcga atatatgtgt 5940 
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agaaactgcc ggaaatcgtc gtggtattca ctccagagcg atgaaaacgt ttcagtttgc 6000 

tcatggaaaa cggtgtaaca agggtgaaca ctatcccata tcaccagctc accgtctttc 6060 

attgccatac g 6071 
<210> 13 
<211> 5641 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pJMF3 
<400> 13 

gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60 

gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120 

ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180 

tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240 

aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300 

ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360 

ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 420 

ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 4 80 

gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 540 

acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtgcggc 600 

cgcactggct tactatgttg gcactgatga gggtgtcagt gaagtgcttc atgtggcagg 660 

agaaaaaagg ctgcaccggt gcgtcagcag aatatgtgat acaggatata ttccgcttcc 720 

tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat ggcttacgaa 780 

cggggcggag atttcctgga agatgccagg aagatactta acagggaagt gagagggccg 840 

cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa atctgacgct 900 

caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt cccctggcgg 960 

ctccctcgtg cgctctcctg ttcctgcctt tcggtttacc ggtgtcattc cgctgttatg 1020 

gccgcgtttg tctcattcca cgcctgacac tcagttccgg gtaggcagtt cgctccaagc 1080 

tggactgtat gcacgaaccc cccgttcagt ccgaccgctg cgccttatcc ggtaactatc 1140 

gtcttgagtc caacccggaa agacatgcaa aagcaccact ggcagcagcc actggtaatt 1200 

gatttagagg agttagtctt gaagtcatgc gccggttaag gctaaactga aaggacaagt 1260 

tttggtgact gcgctcctcc aagccagtta cctcggttca aagagttggt agctcagaga 1320 

accttcgaaa aaccgccctg caaggcggtt ttttcgtttt cagagcaaga gattacgcgc 1380 

agaccaaaac gatctcaaga agatcatctt atgcggccgc atcagataaa atatttctag 144 0 

atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc tccccatgcg 1500 

agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag actgggcctt 1560 

tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc cgccgggagc 1620 

ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc cgccataaac 1680 

tgccaggcat caaattaagc agaaggccat cctgacggat ggcctttttg cgtttctaca 1740 

aactcttcct gtcgtcatat ctacaagcca tcccccgcat gcattaatgt gagttagctc 1800 

actcattagg caccccaggc tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt 1860 

gtgagcggat aacaatttca cacaggaaac agctatgacc atgattacgc caagcttgca 1920 

tgcctgcagg tcgactctag aggatccccg ggtaccgagc tctgagcaac aagttgtaaa 1980 

ctcactggct gataaccttg atatccaata tgaagtgtta actaaccatg gtgctaacga 2040 

aggtcttgcg tgccaagata tgggcgcaga atgggcttct tgtaacaaag taaacatgac 2100 

gcttgttaac caaggtgaag ctgttgactc aaaagattgg gctatttact tccacagcat 2160 

tcgtctgatt ctggatgttg acaacgagca gttcaaaatc tctcgtgtaa cgggtgacct 2220 

acataagcta gaaccaacag ataagtttga cggcttcgct gccggtgaag aggttgttct 2280 

tccattggtt ggtgaatact ggcaactatt tgaaactgac ttcatgccgg gtgcattcgt 2340 

ttctgctcca aacgcagaac ctaagatgat tgcttctcta aatactgaag atgttgcgtc 2400 

ttttgtgacg ggtcttgaag gtaacaacct aaaacgtaca ccagatgaca acaatgtatt 2460 

tgcaaacgct gtgtctcgtt ttgagaaaaa cgaagaccta gcaacacaag acgtatcaac 2520 

cacgttacta ccaacaccaa tgcacgttga agcgggtaaa ggcaaagtag atatcgcgga 2580 

tggtattgcg ctgcctaaag acgcattcga tgcgactcag ttcgcagcga ttcaagatcg 2640 

tgcagaagtg gtaggtgtgg acgttcgtgg tgatcttcct gtaagcatca ctgttgttcc 2700 
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tgcagacttc accggtgaat tagcaaaatc tggtgcttac gaaatgagca tcaaaggcga 27 60 

cggtattgtg attaaagcgt tcgaccaagc aggcgctttc tacgcagtac aatctatctt 2820 

tggcctggta gatagccaaa atgctgattc tctaccacaa ctgtctatta aagatgcgcc 2880 

tcgttttgat taccgtggtg tgatggtgga tgtggctcgt aacttccact ctaaggacgc 2940 

aatccttgca acgctagacc aaatggcagc gtacaagatg aacaaacttc accttcacct 3000 

aaccgatgat gaaggctggc gtttagaaat cccgggtctg cctgagctga cagaagtggg 3060 

tgctaaccgt tgtttcgata cacaagagaa aagctgttta ctgcctcagc ttggctctgg 3120 

tccaacgaca gacaactttg gctctggcta cttcagcaaa gcagactacg tggaaatctt 3180 

gaaatacgcg aaagcacgta acattgaagt gattccagaa atcgatatgc cagctcacgc 3240 

tcgtgcagca gtagtatcaa tggaagctcg ttacgaccgc ctaatggaag aaggtaaaga 3300 

agctgaagcg aacgaatacc gtctgatgga tcctcaagat acatcaaacg taacgacggt 3360 

tcagttctac aataagcaaa gcttcatcaa cccatgtatg gaatcttcaa ctcgctttgt 3420 

tgataaggtg atttcagaag tggcagcaat gcaccaagaa gctggcgctc cactaacaac 3480 

ttggcacttc ggtggtgacg aagcgaagaa catcaagcta ggtgctggtt tccaagacgt 3540 

taacgcagaa gataaagtaa gctggaaagg cacgattgac ctgtctaaac aagacaagcc 3600 

gtttgcacag tctccacaat gtcagacgct aatcacagat ggcacagtca gtgactttgc 3660 

tcacctacca agccacttcg cggaagaagt gtcgaagatt gttgctgaga aaggcattcc 3720 

aaacttccaa gcttggcaag atggtttgaa atacagtgac ggcgaaaaag cgttcgctac 3780 

agaaaatact cgcgtaaact tctgggacgt tctgtactgg ggcggtactt cctcagtgta 38 4 0 

cgagtggtct aagaaaggtt acgacgtgat tgtttctaac ccagattacg tgtacatgga 3900 

tatgccatac gaagttgacc cgaaagagcg tggttactac tgggcaacac gtgcaacgga 3960 

tactcgtaag atgtttggct ttgcaccaga gaacatgcct caaaacgcag aaacttctgt 4020 

agatcgcgat ggcaatggct ttactggtaa aggtgaaatc gaagcgaaac ctttctacgg 4080 

tctatctgca caactttggt ctgagacagt acgtaacgac gagcaatacg agtacatggt 4140 

attccctcgc gtcctcgctg ctgctcagcg tgcatggcac cgtgctgact gggaaaacga 4200 

ctacaaagtt ggtgttgagt actcgcaaaa ctctaatcta gttgataaag catcgctaaa 4260 

ccaagactac aaccgctttg cgaacgtact tggtcaacgt gaactggcta agctagaaaa 4320 

atcaggtatt gactaccgcc taccagtacc aggtgcaaaa gtagaagatg gtaagctagc 4380 

aatgaacgtt cagttccctg gcgtaacgct tcaatactct ctggatggtg agaactggtt 44 40 

gacttatgca gacaacgctc gtccaaatgt aactggtgaa gtcttcatcc gctcggtatc 4500 

tgcgacaggt gagaaggtaa gccgtatcac tagcgtgaaa taatagcgct cagtattcac 4560 

taaaatcata gttccttact caaagccctc aacttatgtt gggggctttg tttatttttc 4620 

ttcggaaaat aagcgtgatc atgcgacagg tttgatgaca aaaaattagc gcaagaagac 4680 

aaaaatcacc ttgcgctaat gctctgttac aggtcactaa taccatctaa gtagttgatt 4740 

catagtgact gcatatgttg tgttttacag tattatgtag tctgtttttt atgcaaaatc 4800 

taatttaata tattgatatt tatatcattt tacgtttctc gttcagcttt tttatactaa 4860 

gttggcatta taaaaaagca ttgcttatca atttgttgca acgaacaggt cactatcagt 4 920 

caaaataaaa tcattatttg atttcaattt tgtcccactc cctgcctctg tcatcacgat 4 980 

actgtgatgc catggtgtcc gacttatgcc cgagaagatg ttgagcaaac ttatcgctta 5040 

tctgcttctc atagagtctt gcagacaaac tgcgcaactc gtgaaaggta ggcgcccccg 5100 

gtcgaatttg ctttcgaatt tctgccattc atccgcttat tatcacttat tcaggcgtag 5160 

caccaggcgt ttaagggcac caataactgc cttaaaaaaa ttacgccccg ccctgccact 5220 

catcgcagta ctgttgtaat tcattaagca ttctgccgac atggaagcca tcacagacgg 5280 

catgatgaac ctgaatcgcc agcggcatca gcaccttgtc gccttgcgta taatatttgc 534 0 

ccatggtgaa aacgggggcg aagaagttgt ccatattggc cacgtttaaa tcaaaactgg 5400 

tgaaactcac ccagggattg gctgagacga aaaacatatt ctcaataaac cctttaggga 54 60 

aataggccag gttttcaccg taacacgcca catcttgcga atatatgtgt agaaactgcc 5520 

ggaaatcgtc gtggtattca ctccagagcg atgaaaacgt ttcagtttgc tcatggaaaa 5580 

cggtgtaaca agggtgaaca ctatcccata tcaccagctc accgtctttc attgccatac 5640 

g 5641 

<210> 14 
<211> 5670 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> pJMF4 



<400> 14 

gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60 

gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120 

ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180 

tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240 

aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300 

ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360 

ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 420 

ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 4 80 

gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 540 

acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtgcggc 600 

cgcactggct tactatgttg gcactgatga gggtgtcagt gaagtgcttc atgtggcagg 660 

agaaaaaagg ctgcaccggt gcgtcagcag aatatgtgat acaggatata ttccgcttcc 720 

tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat ggcttacgaa 780 

cggggcggag atttcctgga agatgccagg aagatactta acagggaagt gagagggccg 84 0 

cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa atctgacgct 900 

caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt cccctggcgg 960 

ctccctcgtg cgctctcctg ttcctgcctt tcggtttacc ggtgtcattc cgctgttatg 1020 

gccgcgtttg tctcattcca cgcctgacac tcagttccgg gtaggcagtt cgctccaagc 1080 

tggactgtat gcacgaaccc cccgttcagt ccgaccgctg cgccttatcc ggtaactatc 1140 

gtcttgagtc caacccggaa agacatgcaa aagcaccact ggcagcagcc actggtaatt 1200 

gatttagagg agttagtctt gaagtcatgc gccggttaag gctaaactga aaggacaagt 1260 

tttggtgact gcgctcctcc aagccagtta cctcggttca aagagttggt agctcagaga 1320 

accttcgaaa aaccgccctg caaggcggtt ttttcgtttt cagagcaaga gattacgcgc 1380 

agaccaaaac gatctcaaga agatcatctt atgcggccgc atcagataaa atatttctag 1440 

atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc tccccatgcg 1500 

agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag actgggcctt 1560 

tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc cgccgggagc 1620 

ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc cgccataaac 1680 

tgccaggcat caaattaagc agaaggccat cctgacggat ggcctttttg cgtttctaca 1740 

aactcttcct gtcgtcatat ctacaagcca tcccccgcat gcattaatgt gagttagctc 1800 

actcattagg caccccaggc tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt 1860 

gtgagcggat aacaatttca cacaggaaac agctatgacc atgattacgc caagcttgca 1920 

tgcctgcagg tcgactctag aggatccccg ggtaccgagc tctgagcaac aagttgtaaa 1980 

ctcactggct gataaccttg atatccaata tgaagtgtta actaaccatg gtgctaacga 204 0 

aggtcttgcg tgccaagata tgggcgcaga atgggcttct tgtaacaaag taaacatgac 2100 

gcttgttaac caaggtgaag ctgttgactc aaaagattgg gctatttact tccacagcat 2160 

tcgtctgatt ctggatgttg acaacgagca gttcaaaatc tctcgtgtaa cgggtgacct 2220 

acataagcta gaaccaacag ataagtttga cggcttcgct gccggtgaag aggttgttct 2280 

tccattggtt ggtgaatact ggcaactatt tgaaactgac ttcatgccgg gtgcattcgt 2340 

ttctgctcca aacgcagaac ctaagatgat tgcttctcta aatactgaag atgttgcgtc 2400 

ttttgtgacg ggtcttgaag gtaacaacct aaaacgtaca ccagatgaca acaatgtatt 24 60 

tgcaaacgct gtgtctcgtt ttgagaaaaa cgaagaccta gcaacacaag acgtatcaac 2520 

cacgttacta ccaacaccaa tgcacgttga agcgggtaaa ggcaaagtag atatcgcgga 2580 

tggtattgcg ctgcctaaag acgcattcga tgcgactcag ttcgcagcga ttcaagatcg 2640 

tgcagaagtg gtaggtgtgg acgttcgtgg tgatcttcct gtaagcatca ctgttgttcc 2700 

tgcagacttc accggtgaat tagcaaaatc tggtgcttac gaaatgagca tcaaaggcga 2760 

cggtattgtg attaaagcgt tcgaccaagc aggcgctttc tacgcagtac aatctatctt 2820 

tggcctggta gatagccaaa atgctgattc tctaccacaa ctgtctatta aagatgcgcc 2880 

tcgttttgat taccgtggtg tgatggtgga tgtggctcgt aacttccact ctaaggacgc 2940 

aatccttgca acgctagacc aaatggcagc gtacaagatg aacaaacttc accttcacct 3000 

aaccgatgat gaaggctggc gtttagaaat cccgggtctg cctgagctga cagaagtggg 3060 

tgctaaccgt tgtttcgata cacaagagaa aagctgttta ctgcctcagc ttggctctgg 3120 

tccaacgaca gacaactttg gctctggcta cttcagcaaa gcagactacg tggaaatctt 3180 

gaaatacgcg aaagcacgta acattgaagt gattccagaa atcgatatgc cagctcacgc 3240 
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tcgtgcagca gtagtatcaa tggaagctcg ttacgaccgc ctaatggaag aaggtaaaga 3300 

agctgaagcg aacgaatacc gtctgatgga tcctcaagat acatcaaacg taacgacggt 3360 

tcagttctac aataagcaaa gcttcatcaa cccatgtatg gaatcttcaa ctcgctttgt 3420 

tgataaggtg atttcagaag tggcagcaat gcaccaagaa gctggcgctc cactaacaac 34 80 

ttggcacttc ggtggtgacg aagcgaagaa catcaagcta ggtgctggtt tccaagacgt 3540 

taacgcagaa gataaagtaa gctggaaagg cacgattgac ctgtctaaac aagacaagcc 3600 

gtttgcacag tctccacaat gtcagacgct aatcacagat ggcacagtca gtgactttgc 3660 

tcacctacca agccacttcg cggaagaagt gtcgaagatt gttgctgaga aaggcattcc 3720 

aaacttccaa gcttggcaag atggtttgaa atacagtgac ggcgaaaaag cgttcgctac 3780 

agaaaatact cgcgtaaact tctgggacgt tctgtactgg ggcggtactt cctcagtgta 3840 

cgagtggtct aagaaaggtt acgacgtgat tgtttctaac ccagattacg tgtacatgga 3900 

tatgccatac gaagttgacc cgaaagagcg tggttactac tgggcaacac gtgcaacgga 3960 

tactcgtaag atgtttggct ttgcaccaga gaacatgcct caaaacgcag aaacttctgt 4020 

agatcgcgat ggcaatggct ttactggtaa aggtgaaatc gaagcgaaac ctttctacgg 4 080 

tctatctgca caactttggt ctgagacagt acgtaacgac gagcaatacg agtacatggt 4140 

attccctcgc gtcctcgctg ctgctcagcg tgcatggcac cgtgctgact gggaaaacga 4200 

ctacaaagtt ggtgttgagt actcgcaaaa ctctaatcta gttgataaag catcgctaaa 4260 

ccaagactac aaccgctttg cgaacgtact tggtcaacgt gaactggcta agctagaaaa 4320 

atcaggtatt gactaccgcc taccagtacc aggtgcaaaa gtagaagatg gtaagctagc 4380 

aatgaacgtt cagttccctg gcgtaacgct tcaatactct ctggatggtg agaactggtt 4440 

gacttatgca gacaacgctc gtccaaatgt aactggtgaa gtcttcatcc gctcggtatc 4500 

tgcgacaggt gagaaggtaa gccgtatcac tagcgtgaaa taatagcgct cagtattcac 4560 

taaaatcata gttccttact caaagccctc aacttatgtt gggggctttg tttatttttc 4620 

ttcggaaaat aagcgtgatc ccccgggggc gcctaccttt cacgagttgc gcagtttgtc 4680 

tgcaagactc tatgagaagc agataagcga taagtttgct caacatcttc tcgggcataa 474 0 

gtcggacacc atggcatcac agtatcgtga tgacagaggc agggagtggg acaaaattga 4800 

aatcaaataa tgattttatt ttgactgata gtgacctgtt cgttgcaaca aattgataag 4860 

caatgctttt ttataatgcc aacttagtat aaaaaagctg aacgagaaac gtaaaatgat 4920 

ataaatatca atatattaaa ttagattttg cataaaaaac agactacata atactgtaaa 4980 

acacaacata tgcagtcact atgaatcaac tacttagatg gtattagtga cctgtaacag 5040 

agcattagcg caaggtgatt tttgtcttct tgcgctaatt ttttgtcatc aaacctgtcg 5100 

catgatcatg gggctgcagg aattcgatgg tcgaatttgc tttcgaattt ctgccattca 5160 

tccgcttatt atcacttatt caggcgtagc accaggcgtt taagggcacc aataactgcc 5220 

ttaaaaaaat tacgccccgc cctgccactc atcgcagtac tgttgtaatt cattaagcat 5280 

tctgccgaca tggaagccat cacagacggc atgatgaacc tgaatcgcca gcggcatcag 534 0 

caccttgtcg ccttgcgtat aatatttgcc catggtgaaa acgggggcga agaagttgtc 5400 

catattggcc acgtttaaat caaaactggt gaaactcacc cagggattgg ctgagacgaa 54 60 

aaacatattc tcaataaacc ctttagggaa ataggccagg ttttcaccgt aacacgccac 5520 

atcttgcgaa tatatgtgta gaaactgccg gaaatcgtcg tggtattcac tccagagcga 5580 

tgaaaacgtt tcagtttgct catggaaaac ggtgtaacaa gggtgaacac tatcccatat 5640 

caccagctca ccgtctttca ttgccatacg 5670 

<210> 15 
<211> 238 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> LacZ (/chitobiase Fusion 

<220> 
<221> CDS 

<222> (119) . . . (240) 

<221> -35_signal 
<222> (44) . . . (50) 
<223> Lac promoter 
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<221> -10_signal 

<222> (69) ... (74) 

<223> Lac promoter 

<221> protein_bind 

<222> (81) , . . (107) 

<223> Lac repressor binding site 

<221> protein_bind 

<222> (7) . . . (34) 

<223> CAP-cAMP binding site 

<400> 15 

gcatgcatta atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt 60 
ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagct 118 
atg acc atg att acg cca age ttg cat gcc tgc agg teg act eta gag 166 
Met Thr Met lie Thr Pro Ser Leu His Ala Cys Arg Ser Thr Leu Glu 
15 10 15 

gat ccc egg gta ccg age tct gag caa caa gtt gta aae tea ctg get 214 
Asp Pro Arg Val Pro Ser Ser Glu Gin Gin Val Val Asn Ser Leu Ala 
20 25 30 

gat aac ctt gat ate caa tat gaa 238 
Asp Asn Leu Asp lie Gin Tyr Glu 
35 40 



<210> 16 
<211> 40 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> LacZ ( /chitobiase Fusion 
<400> 16 

Met Thr Met lie Thr Pro Ser Leu His Ala Cys Arg Ser Thr Leu Glu 

15 10 15 

Asp Pro Arg Val Pro Ser Ser Glu Gin Gin Val Val Asn Ser Leu Ala 

20 25 30 

Asp Asn Leu Asp lie Gin Tyr Glu 
35 40 

<210> 17 
<211> 22 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> dnaA/chitobiase Fusion 
<400> 17 

Met Ser Leu Ser Leu Trp Gin Gin Cys Leu Ala Arg Leu Gin Asp Glu 

15 10 15 

Leu Val Pro Ser Ser Glu 
20 

<210> 18 
<211> 2643 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<221> CDS 

<222> (1) . . . (2643) 

<223> dnaA/Chitobiase Fusion 

<400> 18 

gtg tea ctt teg ett tgg cag cag tgt ctt gee cga ttg cag gat gag 48 
Met Ser Leu Ser Leu Trp Gin Gin Cys Leu Ala Arg Leu Gin Asp Glu 
15 10 15 

ttg gta ecg age tct gag caa caa gtt gta aac tea ctg get gat aac 96 
Leu Val Pro Ser Ser Glu Gin Gin Val Val Asn Ser Leu Ala Asp Asn 
20 25 30 

ctt gat ate caa tat gaa gtg tta act aac cat ggt get aac gaa ggt 144 
Leu Asp lie Gin Tyr Glu Val Leu Thr Asn His Gly Ala Asn Glu Gly 
35 40 45 

ctt gcg tgc caa gat atg ggc gca gaa tgg get tct tgt aac aaa gta 192 
Leu Ala Cys Gin Asp Met Gly Ala Glu Trp Ala Ser Cys Asn Lys Val 
50 55 60 

aac atg aeg ctt gtt aac caa ggt gaa get gtt gae tea aaa gat tgg 240 
Asn Met Thr Leu Val Asn Gin Gly Glu Ala Val Asp Ser Lys Asp Trp 
65 70 75 80 

get att tae ttc cac age att cgt ctg att ctg gat gtt gae aac gag 288 
Ala lie Tyr Phe His Ser lie Arg Leu lie Leu Asp Val Asp Asn Glu 
85 90 95 

cag ttc aaa ate tct cgt gta acg ggt gae eta eat aag eta gaa eca 336 
Gin Phe Lys lie Ser Arg Val Thr Gly Asp Leu His Lys Leu Glu Pro 
100 105 110 

aca gat aag ttt gae ggc ttc get gee ggt gaa gag gtt gtt ett cca 384 
Thr Asp Lys Phe Asp Gly Phe Ala Ala Gly Glu Glu Val Val Leu Pro 
115 120 125 

ttg gtt ggt gaa tac tgg caa eta ttt gaa act gae ttc atg ecg ggt 432 
Leu Val Gly Glu Tyr Trp Gin Leu Phe Glu Thr Asp Phe Met Pro Gly 
130 135 140 

gca ttc gtt tct get eca aac gea gaa ect aag atg att get tct eta 4 80 

Ala Phe Val Ser Ala Pro Asn Ala Glu Pro Lys Met lie Ala Ser Leu 
145 150 155 160 

aat act gaa gat gtt gcg tct ttt gtg acg ggt ctt gaa ggt aac aac 528 
Asn Thr Glu Asp Val Ala Ser Phe Val Thr Gly Leu Glu Gly Asn Asn 
165 170 175 

eta aaa cgt aca cca gat gae aac aat gta ttt gea aac get gtg tet 576 
Leu Lys Arg Thr Pro Asp Asp Asn Asn Val Phe Ala Asn Ala Val Ser 
180 185 190 



12 



cgt ttt gag aaa aac gaa gac eta gca aca caa gac gta tea acc acg 624 
Arg Phe Glu Lys Asn Giu Asp Leu Ala Thr Gin Asp Val Ser Thr Thr 
195 200 205 

tta eta cca aca cca atg cac gtt gaa gcg ggt aaa ggc aaa gta gat 672 
Leu Leu Pro Thr Pro Met His Val Glu Ala Gly Lys Gly Lys Val Asp 
210 215 220 

ate gcg gat ggt att gcg ctg cct aaa gac gca ttc gat gcg act cag 720 
He Ala Asp Gly He Ala Leu Pro Lys Asp Ala Phe Asp Ala Thr Gin 
225 230 235 240 

ttc gca gcg att caa gat cgt gca gaa gtg gta ggt gtg gac gtt cgt 7 68 

Phe Ala Ala He Gin Asp Arg Ala Glu Val Val Gly Val Asp Val Arg 
245 250 255 

ggt gat ctt cct gta age ate act gtt gtt cct gca gac ttc ace ggt 816 
Gly Asp Leu Pro Val Ser He Thr Val Val Pro Ala Asp Phe Thr Gly 
260 265 270 

gaa tta gca aaa tct ggt get tac gaa atg age ate aaa ggc gac ggt 8 64 

Glu Leu Ala Lys Ser Gly Ala Tyr Glu Met Ser He Lys Gly Asp Gly 
275 280 285 

att gtg att aaa gcg ttc gac caa gca ggc get ttc tac gca gta caa 912 
He Val He Lys Ala Phe Asp Gin Ala Gly Ala Phe Tyr Ala Val Gin 
290 295 300 

tct ate ttt ggc ctg gta gat age caa aat get gat tct eta cca caa 960 
Ser He Phe Gly Leu Val Asp Ser Gin Asn Ala Asp Ser Leu Pro Gin 
305 310 315 320 

ctg tct att aaa gat gcg cct cgt ttt gat tac cgt ggt gtg atg gtg 1008 
Leu Ser He Lys Asp Ala Pro Arg Phe Asp Tyr Arg Gly Val Met Val 
325 330 335 

gat gtg get cgt aac ttc cac tct aag gac gca ate ctt gca acg eta 1056 
Asp Val Ala Arg Asn Phe His Ser Lys Asp Ala He Leu Ala Thr Leu 
340 345 350 

gac caa atg gca gcg tac aag atg aac aaa ctt cac ctt cac eta ace 1104 
Asp Gin Met Ala Ala Tyr Lys Met Asn Lys Leu His Leu His Leu Thr 
355 360 365 

gat gat gaa ggc tgg cgt tta gaa ate eeg ggt ctg cct gag ctg aca 1152 
Asp Asp Glu Gly Trp Arg Leu Glu He Pro Gly Leu Pro Glu Leu Thr 
370 375 380 

gaa gtg ggt get aac cgt tgt ttc gat aca caa gag aaa age tgt tta 1200 
Giu Val Gly Ala Asn Arg Cys Phe Asp Thr Gin Glu Lys Ser Cys Leu 
385 390 395 400 

ctg cct cag ett ggc tct ggt cca acg aca gac aac ttt ggc tct ggc 1248 
Leu Pro Gin Leu Gly Ser Gly Pro Thr Thr Asp Asn Phe Gly Ser Gly 
405 410 415 
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tac ttc age 
Tyr Phe Ser 



cgt aac att 
Arg Asn lie 
435 

gca gca gta 
Ala Ala Val 
450 

ggt aaa gaa 
Gly Lys Glu 
465 

aca tea aac 
Thr Ser Asn 



aae cca tgt 
Asn Pro Cys 



gaa gtg gca 
Glu Val Ala 
515 

cac ttc ggt 
His Phe Gly 
530 

caa gac gtt 
Gin Asp Val 
545 

ctg tct aaa 
Leu Ser Lys 



eta ate aca 
Leu lie Thr 



ttc gcg gaa 
Phe Ala Glu 
595 

ttc caa get 
Phe Gin Ala 
610 

ttc get aca 
Phe Ala Thr 
625 

ggc ggt act 



aaa gca gac 
Lys Ala Asp 
420 

gaa gtg att 
Glu Val He 



gta tea atg 
Val Ser Met 



get gaa gcg 
Ala Glu Ala 
470 

gta acg acg 
Val Thr Thr 
485 

atg gaa tct 
Met Glu Ser 
500 

gca atg cac 
Ala Met His 



ggt gac gaa 
Gly Asp Glu 



aac gca gaa 
Asn Ala Glu 
550 

caa gac aag 
Gin Asp Lys 
565 

gat ggc aca 
Asp Gly Thr 
580 

gaa gtg teg 
Glu Val Ser 



tgg caa gat 
Trp Gin Asp 



gaa aat act 
Glu Asn Thr 
630 

tec tea gtg 



tac gtg gaa 
Tyr Val Glu 
425 

cca gaa ate 
Pro Glu He 
440 

gaa get cgt 
Glu Ala Arg 
455 

aac gaa tac 
Asn Glu Tyr 



gtt cag ttc 
Val Gin Phe 



tea act cgc 
Ser Thr Arg 
505 

caa gaa get 
Gin Glu Ala 
520 

gcg aag aac 
Ala Lys Asn 
535 

gat aaa gta 
Asp Lys Val 



ccg ttt gca 
Pro Phe Ala 



gtc agt gac 
Val Ser Asp 
585 

aag att gtt 
Lys He Val 
600 

ggt ttg aaa 
Gly Leu Lys 
615 

cgc gta aae 
Arg Val Asn 



tac gag tgg 



ate ttg aaa 
He Leu Lys 



gat atg cca 
Asp Met Pro 



tac gac cgc 
Tyr Asp Arg 
460 

cgt ctg atg 
Arg Leu Met 
475 

tac aat aag 
Tyr Asn Lys 
490 

ttt gtt gat 
Phe Val Asp 



ggc get cca 
Gly Ala Pro 



ate aag eta 
He Lys Leu 
540 

age tgg aaa 
Ser Trp Lys 
555 

cag tct cca 
Gin Ser Pro 
570 

ttt get cac 
Phe Ala His 



get gag aaa 
Ala Glu Lys 



tac agt gac 
Tyr Ser Asp 
620 

ttc tgg gac 
Phe Trp Asp 
635 

tct aag aaa 



tac gcg aaa 
Tyr Ala Lys 
430 

get cac get 
Ala His Ala 
445 

eta atg gaa 
Leu Met Glu 



gat cet caa 
Asp Pro Gin 



caa age ttc 
Gin Ser Phe 
495 

aag gtg att 
Lys Val He 
510 

eta aca act 
Leu Thr Thr 
525 

ggt get ggt 
Gly Ala Gly 



ggc acg att 
Gly Thr He 



caa tgt cag 
Gin Cys Gin 
575 

eta cca age 
Leu Pro Ser 
590 

ggc att cca 
Gly He Pro 
605 

ggc gaa aaa 
Gly Glu Lys 



gtt ctg tac 
Val Leu Tyr 



ggt tac gac 



gca 1296 
Ala 



cgt 1344 
Arg 



gaa 1392 
Glu 



gat 1440 

Asp 

480 

ate 1488 
He 



tea 1536 
Ser 



tgg 1584 
Trp 



ttc 1632 
Phe 



gac 1680 

Asp 

560 

acg 1728 
Thr 



cac 1776 
His 



aac 1824 
Asn 



gcg 1872 
Ala 



tgg 1920 

Trp 

640 

gtg 1968 
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Gly Gly Thr 



att gtt tct 
He Val Ser 



gac ccg aaa 
Asp Pro Lys 
675 

cgt aag atg 
Arg Lys Met 
690 

act tct gta 
Thr Ser Val 
705 

gaa gcg aaa 
Glu Ala Lys 



gta cgt aac 
Val Arg Asn 



get get get 
Ala Ala Ala 
755 

aaa gtt ggt 
Lys Val Gly 
770 

teg eta aac 
Ser Leu Asn 
785 

gaa ctg get 
Glu Leu Ala 



eca ggt gca 
Pro Gly Ala 



eet ggc gta 
Pro Gly Val 
835 

tat gca gac 
Tyr Ala Asp 
850 

teg gta tct 
Ser Val Ser 



Ser Ser Val 
645 

aac eca gat 
Asn Pro Asp 
660 

gag cgt ggt 
Glu Arg Gly 



ttt ggc ttt 
Phe Gly Phe 



gat ege gat 
Asp Arg Asp 
710 

eet tte tac 
Pro Phe Tyr 
725 

gac gag eaa 
Asp Glu Gin 
740 

cag cgt gca 
Gin Arg Ala 



gtt gag tac 
Val Glu Tyr 



eaa gac tac 
Gin Asp Tyr 
790 

aag eta gaa 
Lys Leu Glu 
805 

aaa gta gaa 
Lys Val Glu 
820 

aeg ctt eaa 
Thr Leu Gin 



aac get cgt 
Asn Ala Arg 



gcg aca ggt 
Ala Thr Gly 



Tyr Glu Trp 



tac gtg tac 
Tyr Val Tyr 
665 

tac tac tgg 
Tyr Tyr Trp 
680 

gca eca gag 
Ala Pro Glu 
695 

ggc aat ggc 
Gly Asn Gly 



ggt eta tct 
Gly Leu Ser 



tac gag tac 
Tyr Glu Tyr 
745 

tgg cac cgt 
Trp His Arg 
760 

teg eaa aac 
Ser Gin Asn 
775 

aac cgc ttt 
Asn Arg Phe 



aaa tea ggt 
Lys Ser Gly 



gat ggt aag 
Asp Gly Lys 
825 

tac tct ctg 
Tyr Ser Leu 
840 

eca aat gta 
Pro Asn Val 
855 

gag aag gta 
Glu Lys Val 



Ser Lys Lys 
650 

atg gat atg 
Met Asp Met 



gca aca cgt 
Ala Thr Arg 



aac atg cct 
Asn Met Pro 
700 

ttt act ggt 
Phe Thr Gly 
715 

gea eaa ctt 
Ala Gin Leu 
730 

atg gta tte 
Met Val Phe 



get gac tgg 
Ala Asp Trp 



tct aat eta 
Ser Asn Leu 
780 

gcg aac gta 
Ala Asn Val 
795 

att gac tac 
He Asp Tyr 
810 

eta gea atg 
Leu Ala Met 



gat ggt gag 
Asp Gly Glu 



act ggt gaa 
Thr Gly Glu 
860 

age cgt ate 
Ser Arg He 



Gly Tyr Asp 
655 

eca tac gaa 
Pro Tyr Glu 
670 

gca acg gat 
Ala Thr Asp 
685 

eaa aac gea 
Gin Asn Ala 



aaa ggt gaa 
Lys Gly Glu 



tgg tct gag 
Trp Ser Glu 
735 

eet cgc gtc 
Pro Arg Val 
750 

gaa aae gac 
Glu Asn Asp 
7 65 

gtt gat aaa 
Val Asp Lys 



ctt ggt caa 
Leu Gly Gin 



cgc eta eca 
Arg Leu Pro 
815 

aae gtt cag 
Asn Val Gin 
830 

aac tgg ttg 
Asn Trp Leu 
845 

gtc tte ate 
Val Phe He 



act age gtg 
Thr Ser Val 



Val 



gtt 2016 
Val 



act 2064 
Thr 



gaa 2112 
Glu 



ate 2160 

He 

720 

aca 2208 
Thr 



etc 2256 
Leu 



tac 2304 
Tyr 



gea 2352 
Ala 



cgt 2400 

Arg 

800 

gta 2448 
Val 



tte 2496 
Phe 



act 2544 
Thr 



cgc 2592 
Arg 

aaa 2640 
Lys 



15 



865 



870 



875 



880 



taa 2643 
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Met 


Ser 


Leu 


Ser 


Leu 


Trp 


Gin 


Gin 


Cys 


Leu 


Ala 


Arg 


Leu 


Gin 


Asp 


Glu 


1 








5 










10 










15 




Leu 


Val 


Pro 


Ser 


Ser 


Glu 


Gin 


Gin 


Val 


Val 


Asn 


Ser 


Leu 


Ala 


Asp 


Asn 








20 










25 










30 






Leu 


Asp 


He 


Gin 


Tyr 


Glu 


Val 


Leu 


Thr 


Asn 


His 


Gly 


Ala 


Asn 


Glu 


Gly 






35 










40 










45 








Leu 


Ala 


Cys 


Gin 


Asp 


Met 


Gly 


Ala 


Glu 


Trp Ala 


Ser 


Cys 


Asn 


Lys 


Val 




50 










55 










60 










Asn 


Met 


Thr 


Leu 


Val 


Asn 


Gin 


Gly 


Glu 


Ala 


Val 


Asp 


Ser 


Lys 


Asp 


Trp 


65 










70 










75 










80 


Ala 


He 


Tyr 


Phe 


His 


Ser 


He 


Arg 


Leu 


He 


Leu 


Asp 


Val 


Asp 


Asn 


Glu 










85 










90 










95 




Gin 


Phe 


Lys 


He 


Ser 


Arg 


Val 


Thr 


Gly Asp 


Leu 


His 


Lys 


Leu 


Glu 


Pro 








100 










105 










110 






Thr 


Asp 


Lys 


Phe 


Asp 


Gly 


Phe 


Ala 


Ala 


Gly Glu 


Glu 


Val 


Val 


Leu 


Pro 






115 










120 










125 








Leu 


Val 


Glv 


Glu 


Tyr 


Trp 


Gin 


Leu 


Phe 


Glu 


Thr 


Asp 


Phe 


Met 


Pro 


Glv 




130 










135 










140 










Ala 


Phe 


Val 


Ser 


Ala 


Pro 


Asn 


Ala 


Glu 


Pro 


Lys 


Met 


He 


Ala 


Ser 


Leu 


145 










150 










155 










160 


Asn 


Thr 


Glu 


Asp 


Val 


Ala 


Ser 


Phe 


Val 


Thr 


Gly 


Leu 


Glu 


Gly 


Asn 


Asn 










165 










170 










175 




Leu 


Lvs 


Ara 


Thr 


Pro 


Asp 


Asp 


Asn 


Asn 


Val 


Phe 


Ala 


Asn 


Ala 


Val 


Ser 








180 










185 










190 






Arg 


Phe 


Glu 


Lys 


Asn 


Glu 


Asp 


Leu 


Ala 


Thr 


Gin 


Asp 


Val 


Ser 


Thr 


Thr 






195 










200 










205 








Leu 


Leu 


Pro 


Thr 


Pro 


Met 


His 


Val 


Glu 


Ala 


Gly 


Lys 


Gly 


Lys 


Val 


Asp 




210 










215 










220 










He 


Ala 


Asp 


Gly 


He 


Ala 


Leu 


Pro 


Lys 


Asp 


Ala 


Phe 


Asp 


Ala 


Thr 


Gin 


225 










230 










235 










240 


Phe 


Ala 


Ala 


He 


Gin 


Asp 


Arg 


Ala 


Glu 


Val 


Val 


Gly 


Val 


Asp 


Val 


Arg 










245 










250 










255 




Gly 


Asp 


Leu 


Pro 


Val 


Ser 


He 


Thr 


Val 


Val 


Pro 


Ala 


Asp 


Phe 


Thr 


Gly 








260 










265 










270 






Glu 


Leu 


Ala 


Lys 


Ser 


Gly 


Ala 


Tyr 


Glu 


Met 


Ser 


He 


Lys 


Gly 


Asp 


Gly 






275 










280 










285 








He 


Val 


He 


Lys 


Ala 


Phe 


Asp 


Gin 


Ala 


Gly Ala 


Phe 


Tyr 


Ala 


Val 


Gin 




290 










295 










300 










Ser 


He 


Phe 


Gly 


Leu 


Val 


Asp 


Ser 


Gin 


Asn 


Ala 


Asp 


Ser 


Leu 


Pro 


Gin 


305 










310 










315 










320 


Leu 


Ser 


He 


Lys 


Asp 


Ala 


Pro 


Arg 


Phe 


Asp 


Tyr 


Arg 


Gly 


Val 


Met 


Val 










325 










330 










335 




Asp 


Val 


Ala 


Arg 


Asn 


Phe 


His 


Ser 


Lys 


Asp Ala 


He 


Leu 


Ala 


Thr 


Leu 



16 











340 










345 










350 








Asp 


Gin 


Met 


Ala 


Ala 


Tyr 


Lys 


Met 


Asn 


Lys 


Leu 


His 


Leu 


His 


Leu 


Thr 








355 










360 










365 










Asp 


Asp 


Glu 


Gly 


Trp 


Arg 


Leu 


Glu 


He 


Pro 


Gly 


Leu 


Pro 


Glu 


Leu 


Thr 






370 










375 










380 












Glu 


Val 


Gly 


Ala 


Asn 


Arg 


Cys 


Phe 


Asp 


Thr 


Gin 


Glu 


Lvs 


Ser 


Cys 


Leu 




385 










390 










395 










400 




Leu 


Pro 


Gin 


Leu 


Gly 


Ser 


Gly 


Pro 


Thr 


Thr 


Asp 


Asn 


Phe 


Glv 

Jr 


Ser 


Glv 












405 










410 










415 






Tyr 


Phe 


Ser 


Lys 


Ala 


Asp 


Tyr 


Val 


Glu 


He 


Leu 


Lys 


Tyr 


Ala 


Lys 


Ala 










420 










425 










430 








Arg 


Asn 


He 


Glu 


Val 


He 


Pro 


Glu 


He 


Asp 


Met 


Pro 


Ala 


His 


Ala 


Arg 








435 










440 










445 










Ala 


Ala 


Val 


Val 


Ser 


Met 


Glu 


Ala 


Arg 


Tyr 


Asp 


Arg 


Leu 


Met 


Glu 


Glu 






450 










455 










460 












Giy 


Lys 


Glu 


Ala 


Glu 


Ala 


Asn 


Glu 


Tyr 


Arg 


Leu 


Met 


Asp 


Pro 


Gin 


Asp 




465 










470 










475 










480 




Thr 


Ser 


Asn 


Val 


Thr 


Thr 


Val 


Gin 


Phe 


Tyr 


Asn 


Lys 


Gin 


Ser 


Phe 


He 












485 










490 










495 






Asn 


Pro 


Cys 


Met 


Glu 


Ser 


Ser 


Thr 


Arg 


Phe 


Val 


Asp 


Lys 


Val 


He 


Ser 










500 










505 










510 








Glu 


Val 


Ala 


Ala 


Met 


His 


Gin 


Glu 


Ala 


Gly 


Ala 


Pro 


Leu 


Thr 


Thr 


Trp 








515 










520 










525 










His 


Phe 


Gly 


Gly 


Asp 


Glu 


Ala 


Lys 


Asn 


He 


Lys 


Leu 


Gly 


Ala 


Gly 


Phe 






530 










535 










540 










Bl 


Gin 


Asp 


Val 


Asn 


Ala 


Glu 


Asp 


Lys 


Val 


Ser 


Trp 


Lys 


Gly 


Thr 


He 


Asp 


W 


545 










550 










555 










560 




Leu 


Ser 


Lys 


Gin 


Asp 


Lys 


Pro 


Phe 


Ala 


Gin 


Ser 


Pro 


Gin 


Cys 


Gin 


Thr 












565 










570 










575 




id 


Leu 


He 


Thr 


Asp 


Gly 


Thr 


Val 


Ser 


Asp 


Phe 


Ala 


His 


Leu 


Pro 


Ser 


His 










580 










585 










590 








Phe 


Ala 


Glu 


Glu 


Val 


Ser 


Lys 


He 


Val 


Ala 


Glu 


Lys 


Gly 


He 


Pro 


Asn 








595 










600 










605 










Phe 


Gin 


Ala 


Trp 


Gin 


Asp 


Gly 


Leu 


Lys 


Tyr 


Ser 


Asp 


Gly 


Glu 


Lys 


Ala 






610 










615 










620 










C3 


Phe 


Ala 


Thr 


Glu 


Asn 


Thr 


Arg 


Val 


Asn 


Phe 


Trp 


Asp 


Val 


Leu 


Tyr 


Trp 


in 'i 


625 










630 










635 










640 


fli 


Gly 


Gly 


Thr 


Ser 


Ser 


Val 


Tyr 


Glu 


Trp 


Ser 


Lys 


Lys 


Gly 


Tyr 


Asp 


Val 












645 










650 










655 






He 


Val 


Ser 


Asn 


Pro 


Asp 


Tyr 


Val 


Tyr 


Met 


Asp 


Met 


Pro 


Tyr 


Glu 


Val 










660 










665 










670 








Asp 


Pro 


Lys 


Glu 


Arg 


Gly 


Tyr 


Tyr 


Trp 


Ala 


Thr 


Arg 


Ala 


Thr 


Asp 


Thr 








675 










680 










685 










Arg 


Lys 


Met 


Phe 


Gly 


Phe 


Ala 


Pro 


Glu 


Asn 


Met 


Pro 


Gin 


Asn 


Ala 


Glu 






690 










695 










700 












Thr 


Ser 


Val 


Asp 


Arg 


Asp 


Gly 


Asn 


Gly 


Phe 


Thr 


Gly 


Lys 


Gly 


Glu 


He 




705 










710 










715 










720 




Glu 


Ala 


Lys 


Pro 


Phe 


Tyr 


Gly 


Leu 


Ser 


Ala 


Gin 


Leu 


Trp 


Ser 


Glu 


Thr 












725 










730 










735 






Val 


Arg 


Asn 


Asp 


Glu 


Gin 


Tyr 


Glu 


Tyr 


Met 


Val 


Phe 


Pro 


Arg 


Val 


Leu 










740 










745 










750 








Ala 


Ala 


Ala 


Gin 


Arg 


Ala 


Trp 


His 


Arg 


Ala 


Asp 


Trp 


Glu 


Asn 


Asp 


Tyr 








755 










760 










765 










Lys 


Val 


Gly 


Val 


Glu 


Tyr 


Ser 


Gin 


Asn 


Ser 


Asn 


Leu 


Val 


Asp 


Lys 


Ala 






770 










775 










780 












Ser 


Leu 


Asn 


Gin 


Asp 


Tyr 


Asn 


Arg 


Phe 


Ala 


Asn 


Val 


Leu 


Gly 


Gin 


Arg 




785 










790 










795 










800 



17 



Glu Leu Ala Lys 

Pro Gly Ala Lys 
820 

Pro Gly Val Thr 
835 

Tyr Ala Asp Asn 
850 

Ser Val Ser Ala 
865 

18 



Leu Glu Lys Ser 
805 

Val Glu Asp Gly 

Leu Gin Tyr Ser 
840 

Ala Arg Pro Asn 
855 

Thr Gly Glu Lys 
870 



Gly lie Asp Tyr 
810 

Lys Leu Ala Met 
825 

Leu Asp Gly Glu 

Val Thr Gly Glu 
860 

Val Ser Arg He 
875 



Arg Leu Pro Val 
815 

Asn Val Gin Phe 
830 

Asn Trp Leu Thr 
845 

Val Phe He Arg 

Thr Ser Val Lys 
880 



1 



18 



